OpenSearch top k parameter fix #5216

dev2049 · 2023-05-24T22:36:19Z

For most queries it's the size parameter that determines final number of documents to return. Since our abstractions refer to this as k, set this to be k everywhere instead of expecting a separate param. Would be great to have someone more familiar with OpenSearch validate that this is reasonable (e.g. that having size and what OpenSearch calls k be the same won't lead to any strange behavior). cc @naveentatikonda

Closes #5212

naveentatikonda · 2023-05-24T22:44:10Z

@dev2049 size and k are different parameters. k is the number of neighbors the search of each graph will return. You must also include the size option, which indicates how many results the query actually returns.

dev2049 · 2023-05-24T22:53:52Z

@dev2049 size and k are different parameters. k is the number of neighbors the search of each graph will return. You must also include the size option, which indicates how many results the query actually returns.

right, but in LangChain k refers to total num docs to return. happy to add an optional knn_top_k parameter as well and pass that in

naveentatikonda · 2023-05-24T23:03:57Z

@dev2049 size and k are different parameters. k is the number of neighbors the search of each graph will return. You must also include the size option, which indicates how many results the query actually returns.

right, but in LangChain k refers to total num docs to return. happy to add an optional knn_top_k parameter as well and pass that in

People might get confused if we introduce new parameter. I guess that should be fine let's go with what you have already implemented in this PR.

@naveentatikonda

For most queries it's the `size` parameter that determines final number of documents to return. Since our abstractions refer to this as `k`, set this to be `k` everywhere instead of expecting a separate param. Would be great to have someone more familiar with OpenSearch validate that this is reasonable (e.g. that having `size` and what OpenSearch calls `k` be the same won't lead to any strange behavior). cc @naveentatikonda Closes langchain-ai#5212

fix

cfcadca

dev2049 requested a review from hwchase17 May 24, 2023 22:36

dev2049 merged commit 3be9ba1 into master May 25, 2023

dev2049 deleted the 5212-opensearch-vectorstore-cannot-return-more-than-4-retrieved-result branch May 25, 2023 16:51

danielchalef mentioned this pull request Jun 5, 2023

Zep Hybrid Search #5742

Merged

This was referenced Jun 25, 2023

Zep Authentication #6725

Closed

Zep Authentication #6728

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OpenSearch top k parameter fix #5216

OpenSearch top k parameter fix #5216

dev2049 commented May 24, 2023

naveentatikonda commented May 24, 2023

dev2049 commented May 24, 2023

naveentatikonda commented May 24, 2023 •

edited

Loading

OpenSearch top k parameter fix #5216

OpenSearch top k parameter fix #5216

Conversation

dev2049 commented May 24, 2023

naveentatikonda commented May 24, 2023

dev2049 commented May 24, 2023

naveentatikonda commented May 24, 2023 • edited Loading

naveentatikonda commented May 24, 2023 •

edited

Loading