Vector Database Benchmarks - Qdrant #738

irthomasthomas · 2024-03-16T13:25:11Z

Vector Database Benchmarks - Qdrant

Vector Database Benchmarks - Qdrant

DESCRIPTION:
Benchmarking Vector Databases

At Qdrant, performance is the top-most priority. We always make sure that we use system resources efficiently so you get the fastest and most accurate results at the cheapest cloud costs. So all of our decisions from choosing Rust, io optimisations, serverless support, binary quantization, to our fastembed library are all based on our principle. In this article, we will compare how Qdrant performs against the other vector search engines.

Here are the principles we followed while designing these benchmarks:

We do comparative benchmarks, which means we focus on relative numbers rather than absolute numbers.
We use affordable hardware, so that you can reproduce the results easily.
We run benchmarks on the same exact machines to avoid any possible hardware bias.
All the benchmarks are open-sourced, so you can contribute and improve them.

Scenarios we tested

Some of our experiment design decisions are described in the F.A.Q Section. Reach out to us on our Discord channel if you want to discuss anything related Qdrant or these benchmarks.

Single node benchmarks

We benchmarked several vector databases using various configurations of them on different datasets to check how the results may vary. Those datasets may have different vector dimensionality but also vary in terms of the distance function being used. We also tried to capture the difference we can expect while using some different configuration parameters, for both the engine itself and the search operation separately.

Updated: January 2024

Engine	Setup	Dataset	Upload Time(m)	Upload + Index Time(m)	P95(ms)	RPS	P99(ms)	Latency(ms)	Precision
qdrant	qdrant-sq-rps-m-32-ef-256	deep-image-96-angular	4.94	43.81	64.42	1608.64	124.57	60.51	0.92
elasticsearch	elasticsearch-m-32-ef-128	deep-image-96-angular	42.66	324.79	100.069	1151.34	115.11	84.58	0.93
redis	redis-m-32-ef-512	deep-image-96-angular	526.56	526.56	124.26	834.080	128.35	115.22	0.92
weaviate	weaviate-m-16-ef-128	deep-image-96-angular	70.81	70.81	545.95	640.37	922.11	154.72	0.94
milvus	milvus-m-32-ef-128	deep-image-96-angular	4.53	37.45	212.64	580.72	237.94	168.82	0.98

Download raw data: here

Observations

Most of the engines have improved since our last run. Both life and software have trade-offs but some clearly do better:

Qdrant achieves highest RPS and lowest latencies in almost all the scenarios, no matter the precision threshold and the metric we choose. It has also shown 4x RPS gains on one of the datasets.
Elasticsearch has become considerably fast for many cases but it’s very slow in terms of indexing time. It can be 10x slower when storing 10M+ vectors of 96 dimensions! (32mins vs 5.5 hrs)
Milvus is the fastest when it comes to indexing time and maintains good precision. However, it’s not on-par with others when it comes to RPS or latency when you have higher dimension embeddings or more number of vectors.
Redis is able to achieve good RPS but mostly for lower precision. It also achieved low latency with single thread, however its latency goes up quickly with more parallel requests. Part of this speed gain comes from their custom protocol.
Weaviate has improved the least since our last run. Because of relative improvements in other engines, it has become one of the slowest in terms of RPS as well as latency.

Suggested labels

{'label-name': 'Database-Performance', 'label-description': 'Focuses on benchmarking and comparing the performance of different vector databases.', 'confidence': 51.15}

irthomasthomas · 2024-03-16T13:25:12Z

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Vector Database Benchmarks - Qdrant #738

Vector Database Benchmarks - Qdrant #738

irthomasthomas commented Mar 16, 2024

irthomasthomas commented Mar 16, 2024

Vector Database Benchmarks - Qdrant #738

Vector Database Benchmarks - Qdrant #738

Comments

irthomasthomas commented Mar 16, 2024

Vector Database Benchmarks - Qdrant

Suggested labels

{'label-name': 'Database-Performance', 'label-description': 'Focuses on benchmarking and comparing the performance of different vector databases.', 'confidence': 51.15}

irthomasthomas commented Mar 16, 2024

Related content