-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
reproducing your results on MS MARCO #3
Comments
Hello, Please take a look at the coCondenser fine-tuning tutorial. It should answer most of your questions. We can leave this issue open for now in case you run into other problems. |
Thank you for the great tutorial ! |
Thanks for catching that! |
Hi, I was able to replicate the MRR@10 that you reported in the paper ( 0.38) but I was wondering what is the difference between the number that is reported on the leaderboard ( 0.44) vs 0.38? |
Hi, @luyug Thanks for your awesome work. Or if it need some time, could you tell me whether your SOTA model on NQ is trained with mined hard negatives or with both BM hard negatives and mined hard negatives as DPR github? Thanks. |
Hi @luyug, Thanks for your great work! I also have the confusion about the difference between the reported result and leaderboard (0.38 vs. 0.44). Is there any update on this? |
Also interested, from what I remember the main difference is that there's also a reranker applied, would it be possible to get the checkpoint of the reranker? |
Hi, First, I run Fine-tuning Stage 1 with
, and get MRR@10=0.3596, R@1000=0.9771. (Your reported results are MRR@10=0.357, R@1000=0.978). Then, I run the hard negative mining with random sampling 30 negatives from the top-200 retrieval results of Second, I run Fine-tuning Stage 2 with
, and get MRR@10=0.3657, R@1000=0.9761. (Your reported results are MRR@10=0.382, R@1000=0.984). There are several possible issues that I would like to confirm:
Thank you in advance! |
Hi,
Thank you for your great work!
I am willing to replicate your results on MS MARCO passage collection and I have a question regarding
Luyu/co-condenser-marco
model. Is this the final model that you used to retrieve documents? Or do I need to train it on MS MARCO relevant query/passage pairs?Is it possible to provide a little bit more detail on how should I use your dense toolkit with this model?
Thank you in advance!
The text was updated successfully, but these errors were encountered: