PyTorch implementation of 'A Neural Compositional Paradigm for Image Captioning' by B. Dai, S. Fidler, D. Lin, published at NeurIPS 2018. [Link]
- ResNet-152 for NP extraction
- Attention unit
- Two Level LSTM
- Connecting Module
- Evaluation Module
TODO: Implement the data preprocessing and preparation as done in the paper