Vector Database Benchmarks - Qdrant #738
Labels
llm-benchmarks
testing and benchmarking large language models
MachineLearning
ML Models, Training and Inference
New-Label
Choose this option if the existing labels are insufficient to describe the content accurately
Research
personal research notes for a topic
Vector Database Benchmarks - Qdrant
DESCRIPTION:
Benchmarking Vector Databases
At Qdrant, performance is the top-most priority. We always make sure that we use system resources efficiently so you get the fastest and most accurate results at the cheapest cloud costs. So all of our decisions from choosing Rust, io optimisations, serverless support, binary quantization, to our fastembed library are all based on our principle. In this article, we will compare how Qdrant performs against the other vector search engines.
Here are the principles we followed while designing these benchmarks:
Scenarios we tested
Some of our experiment design decisions are described in the F.A.Q Section. Reach out to us on our Discord channel if you want to discuss anything related Qdrant or these benchmarks.
Single node benchmarks
We benchmarked several vector databases using various configurations of them on different datasets to check how the results may vary. Those datasets may have different vector dimensionality but also vary in terms of the distance function being used. We also tried to capture the difference we can expect while using some different configuration parameters, for both the engine itself and the search operation separately.
Updated: January 2024
Download raw data: here
Observations
Most of the engines have improved since our last run. Both life and software have trade-offs but some clearly do better:
Suggested labels
{'label-name': 'Database-Performance', 'label-description': 'Focuses on benchmarking and comparing the performance of different vector databases.', 'confidence': 51.15}
The text was updated successfully, but these errors were encountered: