Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation

PyTorch implementation of the models described in the paper Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation .

Dependencies

Python

Python 3.6
PyTorch >= 0.4
Numpy
NLTK
torchtext 0.2.1
torchvision
revtok
multiset
ipdb

Related code

This code is based on dl4mt-nonauto and RSI-NAT. We mainly modified the model.py (line 1107-1292).

Downloading Datasets

The original translation corpora can be downloaded from (IWLST'16 En-De, WMT'16 En-Ro, WMT'14 En-De). We recommend you to download the preprocessed corpora released in dl4mt-nonauto. Set correct path to data in data_path() function located in data.py before you run the code.

BoN-Joint

Combine the BoN objective and the cross-entropy loss to train NAT from scratch. This process usually takes about 5 days.

$ sh joint_wmt.sh

Take a checkpoint and train the length prediction model. This process usually takes about 1 day.

$ sh tune_wmt.sh

Decode the test set. This process usually takes about 20 seconds.

$ sh decode_wmt.sh

BoN-FT

First, train a NAT model using the cross-entropy loss. This process usually takes about 5 days.

$ sh mle_wmt.sh

Then, take a pre-trained checkpoint and finetune the NAT model using the BoN objective. This process usually takes about 3 hours.

$ sh bontune_wmt.sh

Take a finetuned checkpoint and train the length prediction model. This process usually takes about 1 day.

$ sh tune_wmt.sh

Decode the test set. This process usually takes about 20 seconds.

$ sh decode_wmt.sh

Reinforce-NAT

We also implement Reinforce-NAT (line 1294-1390) described in the paper Retrieving Sequential Information for Non-Autoregressive Neural Machine Translation. See RSI-NAT for the usage.

Citation

If you find the resources in this repository useful, please consider citing:

@article{Shao:19,
  author    = {Chenze Shao, Jinchao Zhang, Yang Feng, Fandong Meng and Jie Zhou},
  title     = {Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation},
  year      = {2019},
  journal   = {arXiv preprint arXiv:1911.09320},
}

Name	Name	Last commit message	Last commit date
Latest commit shaochenze Update README.md Dec 31, 2019 458808b · Dec 31, 2019 History 24 Commits
scripts	scripts	add scripts	Nov 26, 2019
.gitignore	.gitignore	Initial commit	Nov 11, 2019
LICENSE	LICENSE	Initial commit	Nov 22, 2019
LICENSE_nyu	LICENSE_nyu	Initial commit	Nov 22, 2019
README.md	README.md	Update README.md	Dec 31, 2019
bontune_wmt.sh	bontune_wmt.sh	Initial commit	Nov 22, 2019
data.py	data.py	Initial commit	Nov 22, 2019
decode.py	decode.py	Initial commit	Nov 22, 2019
decode_wmt.sh	decode_wmt.sh	Initial commit	Nov 22, 2019
distill.py	distill.py	Initial commit	Nov 22, 2019
joint_wmt.sh	joint_wmt.sh	Initial commit	Nov 22, 2019
mle_wmt.sh	mle_wmt.sh	Initial commit	Nov 22, 2019
model.py	model.py	initial commit	Nov 22, 2019
mscoco.py	mscoco.py	Initial commit	Nov 22, 2019
run.py	run.py	Initial commit	Nov 22, 2019
slides.pdf	slides.pdf	Add files via upload	Dec 17, 2019
test.py	test.py	Initial commit	Nov 22, 2019
train.py	train.py	Initial commit	Nov 22, 2019
tune_wmt.sh	tune_wmt.sh	Initial commit	Nov 22, 2019
utils.py	utils.py	Initial commit	Nov 22, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation

Dependencies

Python

Related code

Downloading Datasets

BoN-Joint

BoN-FT

Reinforce-NAT

Citation

About

Licenses found

Releases

Packages

Languages

License

ictnlp/BoN-NAT

Folders and files

Latest commit

History

Repository files navigation

Minimizing the Bag-of-Ngrams Difference for Non-Autoregressive Neural Machine Translation

Dependencies

Python

Related code

Downloading Datasets

BoN-Joint

BoN-FT

Reinforce-NAT

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages