Software:Infinity (database)
From HandWiki
Developer(s) | InfiniFlow |
---|---|
Written in | C++ |
Available in | English |
License | Apache 2.0 |
Infinity is an AI-native database that provides a wide range of search capabilities for rich data types such as vectors, full-text, and structured data. It provides robust support for various LLM applications, including search, recommenders, question-answering, conversational AI, copilot, content generation, and many more RAG (Retrieval-augmented Generation) applications[1][2][3][4].
Key features
Infinity comes with high performance, flexibility, ease-of-use, and many features designed to address the challenges facing the next-generation AI applications:
Incredibly fast
- Achieves 0.1 milliseconds query latency on million-scale vector datasets.
- Up to 10K QPS on million-scale vector datasets.
Fused search
Supports a fused search of multiple embeddings and full text, in addition to filtering.
Rich data types
Supports a wide range of data types including strings, numerics, vectors, and more.
Ease of use
- Intuitive Python API.
- A single-binary architecture with no dependencies, making deployment a breeze.
References
- ↑ "AI-Native Database, Powering the Next-Gen RAG for LLM". December 26, 2023. https://medium.com/@infiniflowai/ai-native-database-powering-the-next-gen-rag-for-llm-da70cabcac1a.
- ↑ "Why the era of RAG calls for a new AI database". January 10, 2024. https://medium.com/@infiniflowai/why-the-era-of-rag-calls-for-a-new-ai-database-87ee09588710.
- ↑ "Looking to the future of vector databases". February 5, 2024. https://medium.com/@infiniflowai/looking-to-the-future-of-vector-databases-be86f218dc94.
- ↑ "Language models: what are they, why are they talked about so much, and what future will they have?". January 7, 2024. https://www.repubblica.it/tecnologia/2024/01/07/news/modelli_di_linguaggio_cosa_sono_perche_se_ne_parla_tanto_e_che_futuro_avranno-421823928/.
Original source: https://en.wikipedia.org/wiki/Infinity (database).
Read more |