[META] Add performance and accuracy benchmarks for Neural search Features #430

navneet1v · 2023-10-10T17:19:10Z

jmazanec15 · 2023-10-10T21:53:16Z

Im wondering if as part of this, we should add search relevance metrics/workload to OSB? For instance, for the text-based queries, one key question this will answer is when to use what and what are the tradeoffs? We could have a generic OSB run where the input/output stays constant (like datasets for BEIR) and we just change the internal implementation. When a new method (i.e. reranker, or different combination logic such as RRF) comes in, we can just plug them into the OSB configuration, run the test and determine where it stacks up.

navneet1v · 2023-10-12T18:15:13Z

@jmazanec15 the idea of this issue is to have a high level issue to add the benchmarks. Now what should be used to do the benchmarks like OSB or something else is not decided and I left it open. If we start using OSB then yes we need to get Search relevance metrics in OSB. But we should work with OSB team to provide a capability get these custom metrics.

sam-herman · 2023-11-21T02:49:25Z

@jmazanec15 the idea of this issue is to have a high level issue to add the benchmarks. Now what should be used to do the benchmarks like OSB or something else is not decided and I left it open. If we start using OSB then yes we need to get Search relevance metrics in OSB. But we should work with OSB team to provide a capability get these custom metrics.

+1 I think first priority is to come up with benchmarks that help with providing a baseline to quality of search.
Regarding OSB as an implementation platform I'm not so sure. It is implemented in Ruby and focuses on stress testing while we are more trying to define metrics of quality. For that even small data sets can do just fine and we can run those even as part of IT tests like the embedded JMH framework would seem more native solution to the task.

jmazanec15 · 2023-11-28T01:14:29Z

+1 I think first priority is to come up with benchmarks that help with providing a baseline to quality of search.

Yes, definitely agree with this.

It is implemented in Ruby and focuses on stress testing while we are more trying to define metrics of quality.

OSB is actually in python. Should be more friendly with existing data sets.

For that even small data sets can do just fine and we can run those even as part of IT tests like the embedded JMH framework would seem more native solution to the task.

Thats interesting - Im not super familiar with it, but could make sense - itd be nice to have as an integ test. I guess I like OSB becuase it would (1) be easier to integrate into automated performance testing infrastructure/metric publishing (2) let users test relevance for their own clusters easier (i.e. just point the OSB workload or a custom workload at their cluster and let it run). But maybe it makes sense to do both.

navneet1v · 2024-03-25T19:18:01Z

There has been a PR : https://github.com/opensearch-project/opensearch-benchmark-workloads/pull/232/files

added for doing text_embeddings benchmarks.

github-actions bot added the untriaged label Oct 10, 2023

navneet1v added backlog and removed untriaged labels Oct 10, 2023

navneet1v added this to Vector Search RoadMap Oct 10, 2023

navneet1v moved this from Backlog to Backlog (Hot) in Vector Search RoadMap Oct 10, 2023

github-project-automation bot moved this to Backlog in Vector Search RoadMap Oct 10, 2023

navneet1v assigned vamshin Oct 10, 2023

navneet1v changed the title ~~Add performance and accuracy benchmarks for Neural search Features~~ [META] Add performance and accuracy benchmarks for Neural search Features Oct 10, 2023

jmazanec15 mentioned this issue Oct 11, 2023

[RFC] Enhancements for OSB Workloads opensearch-project/opensearch-benchmark#253

Open

navneet1v mentioned this issue Oct 12, 2023

Add Guideline/process/steps for adding new techniques for Hybrid Search #444

Open

sam-herman mentioned this issue Nov 21, 2023

[FEATURE] Add z-score for the normalization processor #376 #470

Open

heemin32 added this to Neural Search RoadMap Dec 26, 2024

heemin32 removed this from Vector Search RoadMap Dec 26, 2024

heemin32 moved this to Backlog in Neural Search RoadMap Dec 26, 2024

heemin32 unassigned vamshin Dec 26, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[META] Add performance and accuracy benchmarks for Neural search Features #430

[META] Add performance and accuracy benchmarks for Neural search Features #430

navneet1v commented Oct 10, 2023 •

edited

Loading

jmazanec15 commented Oct 10, 2023

navneet1v commented Oct 12, 2023

sam-herman commented Nov 21, 2023

jmazanec15 commented Nov 28, 2023

navneet1v commented Mar 25, 2024

[META] Add performance and accuracy benchmarks for Neural search Features #430

[META] Add performance and accuracy benchmarks for Neural search Features #430

Comments

navneet1v commented Oct 10, 2023 • edited Loading

Description

Tasks

jmazanec15 commented Oct 10, 2023

navneet1v commented Oct 12, 2023

sam-herman commented Nov 21, 2023

jmazanec15 commented Nov 28, 2023

navneet1v commented Mar 25, 2024

navneet1v commented Oct 10, 2023 •

edited

Loading