Add a few more things
1 Refactor the code a little bit.
2 Add BPE (didn’t seem to work much different)
3 Add nucleus sampling, topk and gumbel softmax sampling.
4 Make AttEnsemble compatible with transformer
5 Add remove bad ending from Improving Reinforcement Learning Based Image Captioning with Natural Language Prior