This repository contains the source code for TAP introduced in the following papers:
- v1: A gru-based encoder-decoder approach with attention for online handwritten mathematical expression recognition
- v2: Track, attend and parse (TAP): An end-to-end framework for online handwritten mathematical expression recognition(Under reviewed by Transactions on Multimedia)
Here, v1 employs the coverage based spatial attention model, v2 employs the guided hybrid attention model.
- Install cuda-8.0 cudnn-v7
- Install Theano.0.10.0 with libgpuarray
If you find TAP useful in your research, please consider citing:
@inproceedings{zhang2017gru,
title={A GRU-based Encoder-Decoder Approach with Attention for Online Handwritten Mathematical Expression Recognition},
author={Zhang, Jianshu and Du, Jun and Dai, Lirong},
booktitle={Document Analysis and Recognition (ICDAR), 2017 14th International Conference on},
year={2017},
organization={IEEE}
}
-
Train TAP without using weightnoise and save the best model in terms of WER
$ bash train.sh
-
Anneal the best model by using weightnoise and save the new best model
$ bash train_weightnoise.sh
-
Reload the new best model and generate the testing latex strings
$ bash test.sh
xysszjs at mail.ustc.edu.cn
West campus of University of Science and Technology of China
Any discussions, suggestions and questions are welcome!