Open Source | License | GitHub stars | Developed in (language) | Summary | Business Model | Embeds / Uses | founding date / first released date | In-memory Unterstützung | Sharding | Index Types | Consistency Model | Benchmarks (Performance?) | Queries per second (using text nytimes-256-angular) | Latency, ms (Recall/Percentile 95 (millis), nytimes-256-angular) | Approximate Nearest Neighbor (ANN) Vector Databases | Funding | Who's behind it | HQ in | |||
Marqo AI | Y | Apache-2.0 | 2.8k ⭐ | Python | A tensor-based cloud-native commercial Open Source search and analytics engine. | Open SaaS | Tensor-based | ❔ | Y | HNSW | - | ❔ | ❔ | undisclosed preseed in May 2022 | S2Search Australia Pty Ltd | 🇦🇺 | |||||
Weaviate | Y | BSD | 5.6k ⭐ | Assembly, C++, GoLang | Weaviate is a commercial Open Source cloud-native vector database that stores both objects and vectors. | Open SaaS | ❔ | started in 2018 as a traditional graph database, first released in 2019 | N | Y, static sharding | a custom HNSW PQ algorithm that supports CRUD | Eventual Consistency | not comparative, just evaluating their own performance | 791 | 2 | Y (multiple ANN algorithms as long as they support full CRUD) | 67.7M USD, series B | SeMI Technologies | 🇪🇺 | ||
Chroma | Y | Apache-2.0 | 4.4k ⭐ | Python & Typescript | Chroma is a Commercial Open Source vector database | Preparing a (Partly Open) SaaS model* [Commercial Open Source] | HNSW lib, DuckDB; based on ClickHouse | looks like 2022 | N | Dynamic segment placement | ❔ | ❔ | Y | 20.3M USD, seed | Chroma Inc. | 🇺🇸 | |||||
Qdrant | Y | Apache-2.0 | 6.6k ⭐ | Rust | Qdrant is a Commercial Open Source vector similarity search engine and vector database | Open SaaS | RocksDB | first released: 2021 | Y | Y, static sharding | HNSW (SQ & PQ) | Eventual Consistency, tunable consistency | compares to weaviate, milvus, elastic (note: redis took too long to complete) | 326 | 4 | Y | 9.8M € | Qdrant Solutions GmbH | 🇪🇺 | ||
Milvus | Y | Apache-2.0 | 18k ⭐ | GoLang & Python | Milvus is a cloud-native Commercial Open Source vector database | (Partly Open) SaaS* [Commercial Open Source] | Initial blog post from them said SQLite, but meanwhile they said
RocksDB - exchanged? they also have a ChatGPT-Cache that is build on SQLite and say "Milvus uses SQLite or MySQL to manage metadata" |
founded 2017, first released: 2019 | N | Dynamic segment placement | ANNOY; HNSW; IVF_PQ; IVF_SQ(; IVF_FLAT; FLAT; IVF_SQ8_H; RNSG | Strong, bounded staleness, session, and eventually. The default consistency level in Milvus is bounded staleness. | not comparative | 2406 | 1 | Y | 113M USD, series B | Zilliz | 🇺🇸 | ||
Vespa | Y | Apache-2.0 | 4.4k ⭐ | Java & C++ | Vespa is a Commercial Open Source vector database by Yahoo! It is a search engine which supports vector search, lexical search, and search in structured data | Open SaaS | ❔ | Originally a web search engine (alltheweb), acquired by Yahoo! in 2003 and later open sourced as Vespa in 2017; sinde Oct 2023 spinoff, raised series A in Nov 2023 | maintains disk and memory structures for documents | Y | Custom HNSW (Multi-vector hybrid HNSW-IF) | Eventual Consistency | not comparative | ❔ | ❔ | Y | Spinoff from Yahoo! in Oct 2023, then raised a 31M USD series A | Yahoo! | 🇺🇸 | ||
Vald | Y | Apache-2.0 | 1.2k ⭐ | GoLang | Vald is a cloud-native Open Source distributed approximate nearest neighbor (ANN) dense vector search engine | Community project, currently looks like no commercial interests are pursued | uses the vector search engine NGT | Technology incubation at Yahoo! Japan Corporation, development was stared in 2019 | ❔ | N/A | N/A | N/A | not comparitive, but Vald performance only | ❔ | ❔ | Y (NGT) | - | Yusuke Kato (Yahoo Japan Corporation), Kiichiro Yukawa (Yahoo Japan Corporation) | 🇯🇵 | ||
Pinecone |
N | Proprietary | NA | Pinecone is a fully managed vector database that specializes in enabling semantic search capabilities | SaaS | built on top of Faiss | first released in 2019 | N | Y | proprietary | Eventual Consistency | more programming language comparison for vector databases | 150 (for p2, but more pods can be added) | 1 (batched search, 0.99 recall, 200k SBERT) | Y (proprietary), plus KNN (with Faiss) | 138M, series B | Pinecone Systems Inc | 🇺🇸 | |||