Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
searchivarius authored Dec 14, 2020
1 parent 8f2f682 commit b117706
Showing 1 changed file with 3 additions and 2 deletions.
5 changes: 3 additions & 2 deletions scripts/data_convert/msmarco/README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
# MSMARCO
1. [Document ranking](DOCS.md)
2. [Passage ranking](PASSAGES.md)
3. A [detailed notebook](MSMARCO_docs_2020-10-12_complete.ipynb) to
reproduce our best traditional runs submitted
3. Step-by-step notebooks to reproduce our run submitted
[to the MS MARCO leaderboard in December 2020](https://microsoft.github.io/msmarco/#docranking).
* [One notebook](MSMARCO_docs_2020-12-06_complete.ipynb) reproduces all steps necessary to download the data, preprocess it, and train all the models.
* [The second notebook](MSMARCO_docs_2020-12-06_processed_data_and_precomp_model1.ipynb) operates on preprocessed data in FlexNeuART JSONL format. It does not require running GIZA to generate IBM Model 1 (these models are already trained).

0 comments on commit b117706

Please sign in to comment.