wmt

History

Name		Name	Last commit message	Last commit date
parent directory ..
config		config
README.md		README.md
eval_wmt_ende.sh		eval_wmt_ende.sh
prepare_data.sh		prepare_data.sh
run_wmt_ende.sh		run_wmt_ende.sh

README.md

Steps to train a Transformer model on the WMT English-German dataset

Requirements

tensorflow (1.5)
pyyaml
sentencepiece

Please follow the instructions to install and build SentencePiece. Once it's installed, do not forget to change the SP_PATH variable in scripts.

Data preparation

Before running the script, look at the links to download the datasets. Depending on the task, you may change the filenames and the folders paths.

cd scripts/wmt
./prepare_data.sh

The script will train a SentencePiece model using the same source and target vocabulary. It will tokenize the dataset and prepare the train/valid/test files.

Training

cd scripts/wmt
./run_wmt_ende.sh

By default (to be modified in wmt-ende.yml) training will be done on 4 GPUs and during 200,000 steps.

Translate

cd scripts/wmt
./eval_wmt_ende.sh

Lazy run...

Pre-tokenized SentencePiece dataset
Pre-trained averaged model:
- checkpoint
- export

This model achieved the following scores:

Test set	NIST BLEU
newstest2014	26.9
neswtest2017	28.0

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Files

wmt

wmt

README.md

Steps to train a Transformer model on the WMT English-German dataset

Requirements

Data preparation

Training

Translate

Lazy run...

Files

wmt

Directory actions

More options

Directory actions

More options

Latest commit

History

wmt

Folders and files

parent directory

README.md

Steps to train a Transformer model on the WMT English-German dataset

Requirements

Data preparation

Training

Translate

Lazy run...