-
Notifications
You must be signed in to change notification settings - Fork 2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Discussion] bm25 semantic search / QueryExpander #573
Comments
Hey @flozi00 , Yes, this sounds interesting and I could see an integration via creating a new "QueryExpander" class.
Do you really mean DPR here? I thought you wanted to pass your expanded query to ES to get the "speed of bm25"?
Can you give an example of this step? I thought you would filter out keywords here, but later you say "+ keywords (3 times)", so I guess I am not understanding your step here ... |
Yeah, we let DPR rerank the results, but it is not mandatory.
Raw query: Please tell me whats the weather in berlin
|
Ok got it. I totally see the value of such a new
Would you be interested in raising a PR? We would then take care of integrating it into the new, upcoming |
Yeah, I can do so. |
Great! Very much appreciated 👍 |
This issue has been automatically marked as stale because it has not had recent activity. It will be closed in 21 days if no further activity occurs. |
Dear @tholor, we just arrived here via [1] and would like to salute you and the other authors of this framework for their efforts. In the context of what @flozi00 was asking for above, like »Raw query: Please tell me what's the weather in Berlin«, we would like to share our little experimental project [2] with you. Andrew Wigmore (@visualcrossing) might be interested in this topic as well [3]. In this manner, I am humbly asking you to reopen this issue to keep it as a note where people are actively interested in, or, maybe, just move it to the "Discussions" section? I believe it could fit better there. Keep up the spirit and with kind regards, [1] https://news.ycombinator.com/item?id=29501045 |
In an internal system we did not passed the raw question to elastic search.
We wanted the speed of bm25 but the intuitivity of vectorized search too, so we some manipulations to the elastic search Input.
After that we passed the data to DPR and those results to the QA model.
Would such an integration make sense for this project ?
The text was updated successfully, but these errors were encountered: