[META] Improve performance for exact search with script scoring #1709

jmazanec15 · 2024-05-20T21:17:48Z

Description

Meta-issue for improving performance of exact search with script scoring.

I ran several experiments show-casing performance of exact scoring for single node. In addition, I captured several profiling examples.

The testing code can be found in https://github.com/jmazanec15/opensearch-knn-rescore-experiments/. Code that was benchmarked can be found in https://github.com/jmazanec15/k-NN-1/tree/exact-scoring-exps. The cohere dataset with 1M-768 dim vecs, 10k queries, and innerproduct space type was used.

For 1M-768 dim vectors, the fastest exact search can do is 102 ms/119 ms/124 ms with lucene backed storage and simd enabled
There is a 2x difference in perf between lucene and plugin formats. When all vectors can fit in memory, the p50/p90/p99 latency is 323 ms/325 ms/344 ms (no simd) for lucene backed storage and 568 ms/584 ms/594 ms (no simd) for plugin backed storage. This indicates that lucene’s vector format is almost 2x faster than the plugins for script scoring. The cause of this appears to be that Lucene is able to directly map float vectors into JVM via Panama. For the plugin, it requires copying bytes in and then deserializing. There is overlap with [Enhancement] Optimize the de-serialization of vector when reading from Doc Values #1050
SIMD gave a 3x improvement over non-SIMD. Without SIMD, for the Lucene backed storage, the p50/p90/p99 latency is 323 ms/325 ms/344 ms. With SIMD, it is 101 ms/119 ms/124 ms.

The following configurations were used to execute these tests:

Run #	p50 latency (ms)	p90 latency (ms)	p99 latency (ms)	Recall
1	324	326	344	0.99998
2	324	325	328	0.99998

Run #	p50 latency (ms)	p90 latency (ms)	p99 latency (ms)	Recall
1	102	119	124	0.999999
2	103	119	125	0.999999

Run #	p50 latency (ms)	p90 latency (ms)	p99 latency (ms)	Recall
1	674	684	692	0.99998
2	674	684	692	0.99998

Run #	p50 latency (ms)	p90 latency (ms)	p99 latency (ms)	Recall
1	568	584	594	0.99998
2	568	584	596	0.99998

github-actions bot added the untriaged label May 20, 2024

jmazanec15 added Enhancements Increases software capabilities beyond original client specifications and removed untriaged labels May 20, 2024

jmazanec15 mentioned this issue May 20, 2024

Support script score when doc value is disabled and fix misusing DISI #1696

Merged

5 tasks

jmazanec15 mentioned this issue Jun 10, 2024

[k-NN] Avoid additional copy to stream during binary doc values deserialization #1736

Closed