Add replication for MS MARCO passage experiment #25

MXueguang · 2020-05-22T16:54:33Z

Replication Status:
Success (No issues)

Environment:
OS: Ubuntu18.04 LTS
Python: 3.7.6
Pytorch: 1.5.0
GPU: Tesla P4
CUDA: 10.2

Replication Results:
monoBERT:

monoT5:

lintool · 2020-05-22T16:57:14Z

@ronakice do we really want separate lists for both models?

How about:

Results replicated by @MXueguang on 2020-05-22(commit 69de7db) (model 1 + model 2...)

ronakice · 2020-05-22T17:00:42Z

@lintool in the case that we add more models (Which we probably will) I think this would be helpful? I figured that having people say (model 1 + model 2 + model 3) and occasionally just (model 2) is kinda messy? Unless we have separate docs for each.

ronakice · 2020-05-22T17:44:15Z

@MXueguang great job being the first to replicate our results! Do you happen to remember the amount of time it took on the P4 (not the preprocessing just curious about the 'inference' itself, I see monoT5 took ~27 minutes!

MXueguang · 2020-05-22T17:52:03Z

@ronakice
monoBERT took 1:00:10 (default batch size)
monoT5 took 27:36 (batch size = 96)

lintool · 2020-05-22T17:57:22Z

@lintool in the case that we add more models (Which we probably will) I think this would be helpful? I figured that having people say (model 1 + model 2 + model 3) and occasionally just (model 2) is kinda messy? Unless we have separate docs for each.

I think we should keep the models manageable per page? So, this being the "intro", no more models on this page. What do you think?

ronakice · 2020-05-22T18:07:36Z

That sounds like a good idea. docs/experiments-msmarco-passage.md serves as an intro linking into say docs/experiments-msmarco-passage-monot5.md and docs/experiments-msmarco-passage-monobert.md

lintool · 2020-05-22T18:10:07Z

That sounds like a good idea. docs/experiments-msmarco-passage.md serves as an intro linking into say docs/experiments-msmarco-passage-monot5.md and docs/experiments-msmarco-passage-monobert.md

But I think two models on the same page is reasonable... so keep the page content as is (but fold the two replication sections into one); if we want to add another model, we create a new page?

ronakice · 2020-05-22T18:15:19Z

Sounds like a plan. Yes, I was thinking that might be a good idea too since these are both similar enough pointwise estimators. Do we need to track GPUs this has worked on? We can assume that whoever replicates monoBERT can also replicate monoT5-base since it is quite a bit faster and ignore the (model 1 + model 2) part too. Something like

Results replicated by @MXueguang on 2020-05-22 (commit 69de7db) (Tesla P4)

lintool · 2020-05-22T18:32:47Z

Results replicated by @MXueguang on 2020-05-22 (commit 69de7db) (Tesla P4)

👍 on this. We can always track back to PR for more details.

lintool · 2020-05-22T18:33:17Z

@MXueguang please add the GPU, and @ronakice can merge.

ronakice · 2020-05-22T18:39:01Z

@MXueguang you can also remove the (monoBERT + monoT5) part. LGTM other than that!

ronakice · 2020-05-22T18:47:54Z

Awesome, great job @MXueguang

Add replication for MS MARCO passage

4a2adbc

fold two replication sections into one

7fe5352

add GPUs info in replication logs

6053c0a

MXueguang and others added 2 commits May 23, 2020 02:43

remove model names from replication logs

643ec35

clarify replication remarks

d9bf64d

ronakice merged commit 6e9dfc6 into castorini:master May 22, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add replication for MS MARCO passage experiment #25

Add replication for MS MARCO passage experiment #25

MXueguang commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020 •

edited

Loading

ronakice commented May 22, 2020

MXueguang commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

lintool commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

ronakice commented May 22, 2020

Add replication for MS MARCO passage experiment #25

Add replication for MS MARCO passage experiment #25

Conversation

MXueguang commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020 • edited Loading

ronakice commented May 22, 2020

MXueguang commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

lintool commented May 22, 2020

lintool commented May 22, 2020

ronakice commented May 22, 2020

ronakice commented May 22, 2020

ronakice commented May 22, 2020 •

edited

Loading