-
Use Amazon Deep Learning AMI (Ubuntu) Version 15.0 - ami-0b43cec40e1390f34
-
Configure Drive for 100GB of space + SSD
-
Install pytorch 0.4.1 w/ CUDA:
conda install pytorch torchvision cuda92 -c pytorch
Clone git repo:
git clone https://github.com/hitvoice/DrQA.git
Follow readme:
cd DrQA
pip install -r requirements.txt
bash download.sh
python prepro.py
python train.py -e 40 -bs 128 --save_last_only --save_dawn_logs
Technically, the p3 can support larger batch sizes, but convergence decreases, so use this size.
Then, use the following to generate the .tsv entry for Dawnbench:
grep dawn_entry models/log.txt | cut -c 33- > models/dawn_entry.tsv