๐๏ธ Activeloop Deep Lake
Activeloop Deep Lake as a Multi-Modal Vector Store that stores embeddings and their metadata including text, Jsons, images, audio, video, and more. It saves the data locally, in your cloud, or on Activeloop storage. It performs hybrid search including embeddings and their attributes.
๐๏ธ Alibaba Cloud OpenSearch
Alibaba Cloud Opensearch is a one-stop platform to develop intelligent search services. OpenSearch was built on the large-scale distributed search engine developed by Alibaba. OpenSearch serves more than 500 business cases in Alibaba Group and thousands of Alibaba Cloud customers. OpenSearch helps develop search services in different search scenarios, including e-commerce, O2O, multimedia, the content industry, communities and forums, and big data query in enterprises.
๐๏ธ AnalyticDB
AnalyticDB for PostgreSQL is a massively parallel processing (MPP) data warehousing service that is designed to analyze large volumes of data online.
๐๏ธ Annoy
Annoy (Approximate Nearest Neighbors Oh Yeah) is a C++ library with Python bindings to search for points in space that are close to a given query point. It also creates large read-only file-based data structures that are mmapped into memory so that many processes may share the same data.
๐๏ธ Atlas
Atlas is a platform by Nomic made for interacting with both small and internet scale unstructured datasets. It enables anyone to visualize, search, and share massive datasets in their browser.
๐๏ธ AwaDB
AwaDB is an AI Native database for the search and storage of embedding vectors used by LLM Applications.
๐๏ธ Azure Cognitive Search
Azure Cognitive Search (formerly known as Azure Search) is a cloud search service that gives developers infrastructure, APIs, and tools for building a rich search experience over private, heterogeneous content in web, mobile, and enterprise applications.
๐๏ธ BagelDB
BagelDB (Open Vector Database for AI), is like GitHub for AI data.
๐๏ธ Cassandra
Apache Cassandraยฎ is a NoSQL, row-oriented, highly scalable and highly available database.
๐๏ธ Chroma
Chroma is a AI-native open-source vector database focused on developer productivity and happiness. Chroma is licensed under Apache 2.0.
๐๏ธ ClickHouse
ClickHouse is the fastest and most resource efficient open-source database for real-time apps and analytics with full SQL support and a wide range of functions to assist users in writing analytical queries. Lately added data structures and distance search functions (like L2Distance) as well as approximate nearest neighbor search indexes enable ClickHouse to be used as a high performance and scalable vector database to store and search vectors with SQL.
๐๏ธ DashVector
DashVector is a fully-managed vectorDB service that supports high-dimension dense and sparse vectors, real-time insertion and filtered search. It is built to scale automatically and can adapt to different application requirements.
๐๏ธ Dingo
Dingo is a distributed multi-mode vector database, which combines the characteristics of data lakes and vector databases, and can store data of any type and size (Key-Value, PDF, audio, video, etc.). It has real-time low-latency processing capabilities to achieve rapid insight and response, and can efficiently conduct instant analysis and process multi-modal data.
๐๏ธ DocArray HnswSearch
DocArrayHnswSearch is a lightweight Document Index implementation provided by Docarray that runs fully locally and is best suited for small- to medium-sized datasets. It stores vectors on disk in hnswlib, and stores all other data in SQLite.
๐๏ธ DocArray InMemorySearch
DocArrayInMemorySearch is a document index provided by Docarray that stores documents in memory. It is a great starting point for small datasets, where you may not want to launch a database server.
๐๏ธ Elasticsearch
Elasticsearch is a distributed, RESTful search and analytics engine, capable of performing both vector and lexical search. It is built on top of the Apache Lucene library.
๐๏ธ Epsilla
Epsilla is an open-source vector database that leverages the advanced parallel graph traversal techniques for vector indexing. Epsilla is licensed under GPL-3.0.
๐๏ธ Faiss
Facebook AI Similarity Search (Faiss) is a library for efficient similarity search and clustering of dense vectors. It contains algorithms that search in sets of vectors of any size, up to ones that possibly do not fit in RAM. It also contains supporting code for evaluation and parameter tuning.
๐๏ธ Hologres
Hologres is a unified real-time data warehousing service developed by Alibaba Cloud. You can use Hologres to write, update, process, and analyze large amounts of data in real time.
๐๏ธ LanceDB
LanceDB is an open-source database for vector-search built with persistent storage, which greatly simplifies retrevial, filtering and management of embeddings. Fully open source.
๐๏ธ LLMRails
LLMRails is a API platform for building GenAI applications. It provides an easy-to-use API for document indexing and querying that is managed by LLMRails and is optimized for performance and accuracy.
๐๏ธ Marqo
This notebook shows how to use functionality related to the Marqo vectorstore.
๐๏ธ Google Vertex AI MatchingEngine
This notebook shows how to use functionality related to the GCP Vertex AI MatchingEngine vector database.
๐๏ธ Meilisearch
Meilisearch is an open-source, lightning-fast, and hyper relevant search engine. It comes with great defaults to help developers build snappy search experiences.
๐๏ธ Milvus
Milvus is a database that stores, indexes, and manages massive embedding vectors generated by deep neural networks and other machine learning (ML) models.
๐๏ธ MongoDB Atlas
MongoDB Atlas is a fully-managed cloud database available in AWS, Azure, and GCP. It now has support for native Vector Search on your MongoDB document data.
๐๏ธ MyScale
MyScale is a cloud-based database optimized for AI applications and solutions, built on the open-source ClickHouse.
๐๏ธ Neo4j Vector Index
Neo4j is an open-source graph database with integrated support for vector similarity search
๐๏ธ NucliaDB
You can use a local NucliaDB instance or use Nuclia Cloud.
๐๏ธ OpenSearch
OpenSearch is a scalable, flexible, and extensible open-source software suite for search, analytics, and observability applications licensed under Apache 2.0. OpenSearch is a distributed search and analytics engine based on Apache Lucene.
๐๏ธ Postgres Embedding
Postgres Embedding is an open-source vector similarity search for Postgres that uses Hierarchical Navigable Small Worlds (HNSW) for approximate nearest neighbor search.
๐๏ธ PGVector
PGVector is an open-source vector similarity search for Postgres
๐๏ธ Pinecone
Pinecone is a vector database with broad functionality.
๐๏ธ Qdrant
Qdrant (read: quadrant ) is a vector similarity search engine. It provides a production-ready service with a convenient API to store, search, and manage points - vectors with an additional payload. Qdrant is tailored to extended filtering support. It makes it useful for all sorts of neural network or semantic-based matching, faceted search, and other applications.
๐๏ธ Redis
Redis vector database introduction and langchain integration guide.
๐๏ธ Rockset
Rockset is a real-time search and analytics database built for the cloud. Rockset uses a Converged Indexโข with an efficient store for vector embeddings to serve low latency, high concurrency search queries at scale. Rockset has full support for metadata filtering and handles real-time ingestion for constantly updating, streaming data.
๐๏ธ ScaNN
ScaNN (Scalable Nearest Neighbors) is a method for efficient vector similarity search at scale.
๐๏ธ SingleStoreDB
SingleStoreDB is a high-performance distributed SQL database that supports deployment both in the cloud and on-premises. It provides vector storage, and vector functions including dotproduct and euclideandistance, thereby supporting AI applications that require text similarity matching.
๐๏ธ scikit-learn
scikit-learn is an open source collection of machine learning algorithms, including some implementations of the k nearest neighbors. SKLearnVectorStore wraps this implementation and adds the possibility to persist the vector store in json, bson (binary json) or Apache Parquet format.
๐๏ธ sqlite-vss
sqlite-vss is an SQLite extension designed for vector search, emphasizing local-first operations and easy integration into applications without external servers. Leveraging the Faiss library, it offers efficient similarity search and clustering capabilities.
๐๏ธ StarRocks
StarRocks is a High-Performance Analytical Database.
๐๏ธ Supabase (Postgres)
Supabase is an open source Firebase alternative. Supabase is built on top of PostgreSQL, which offers strong SQL querying capabilities and enables a simple interface with already-existing tools and frameworks.
๐๏ธ Tair
Tair is a cloud native in-memory database service developed by Alibaba Cloud.
๐๏ธ Tencent Cloud VectorDB
Tencent Cloud VectorDB is a fully managed, self-developed, enterprise-level distributed database service designed for storing, retrieving, and analyzing multi-dimensional vector data. The database supports multiple index types and similarity calculation methods. A single index can support a vector scale of up to 1 billion and can support millions of QPS and millisecond-level query latency. Tencent Cloud Vector Database can not only provide an external knowledge base for large models to improve the accuracy of large model responses but can also be widely used in AI fields such as recommendation systems, NLP services, computer vision, and intelligent customer service.
๐๏ธ Tigris
Tigris is an open source Serverless NoSQL Database and Search Platform designed to simplify building high-performance vector search applications.
๐๏ธ Timescale Vector (Postgres)
This notebook shows how to use the Postgres vector database Timescale Vector. You'll learn how to use TimescaleVector for (1) semantic search, (2) time-based vector search, (3) self-querying, and (4) how to create indexes to speed up queries.
๐๏ธ Typesense
Typesense is an open source, in-memory search engine, that you can either self-host or run on Typesense Cloud.
๐๏ธ USearch
USearch is a Smaller & Faster Single-File Vector Search Engine
๐๏ธ Vald
Vald is a highly scalable distributed fast approximate nearest neighbor (ANN) dense vector search engine.
๐๏ธ vearch
๐๏ธ Vectara
Vectara is a API platform for building GenAI applications. It provides an easy-to-use API for document indexing and querying that is managed by Vectara and is optimized for performance and accuracy.
๐๏ธ Weaviate
Weaviate is an open-source vector database. It allows you to store data objects and vector embeddings from your favorite ML-models, and scale seamlessly into billions of data objects.
๐๏ธ Xata
Xata is a serverless data platform, based on PostgreSQL. It provides a Python SDK for interacting with your database, and a UI for managing your data.
๐๏ธ Zep
Zep is an open source long-term memory store for LLM applications. Zep makes it easy to add relevant documents,
๐๏ธ Zilliz
Zilliz Cloud is a fully managed service on cloud for LF AI Milvusยฎ,