Using Attention to improve performance of OCR
- Check sample directory for data format structure
- Download inceptionv3, or inception_resnet_v2 weights
inceptionv3 : 'https://download.pytorch.org/models/inception_v3_google-1a9a5a14.pth'
inception_resnet_v2 : 'http://data.lip6.fr/cadene/pretrainedmodels/inceptionresnetv2-520b38e4.pth' - Configure train config file
- Run train.py after configuring config file
- For inference, configure config_infer file
- Run inference.py
This work is based on paper
Attention-based Extraction of Structured Information from Street View Imagery
Zbigniew Wojna∗ Alex Gorban† Dar-Shyang Lee† Kevin Murphy† Qian Yu† Yeqing Li† Julian Ibarz†
∗ University College London † Google Inc.