-
Notifications
You must be signed in to change notification settings - Fork 13
Test LSTM OCR Engine in Tesseract #30
Comments
I've just updated the Tesseract version on our dev environment to 4.0.0alpha and rerun the problematic test (the one with document with rather bad quality, where the tables are not aligned, etc) and the result is significantly better than on the older version.
3.04.01:
4.0.0alpha:
However, that does not mean 4.0.0 only makes tiny mistakes like two -> twn, but there are significantly less of them and it is possible to understand the meaning of a sentence regardless of them. |
Yeah, I was keen to see how much better the LSTM OCR engine is. Given that it's still in alpha, but sounds like it is both more accurate and faster |
TODO set as default configuration |
The precision and recall of the new tesseract has been significantly improved according to the official tesseract wiki. On few our tests we could also notice significant improvements running locally on MTSamples. The openjdk v.11 base image (based on Debian) already provides in the repository the new version of tesseract. Hence, with PR #65 we added the new version of tesseract to be used by default within the upcoming version of CogStack Pipeline image. The same for TravisCI builds. I'm closing this issue, however, creating a proper benchmark suite to gather precision/recall/performance metrics would be a good idea to have a direct metrics for future improvements. |
Need to setup some basic benchmarks to test this. Keen to know if it helps with rotations.
https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
tesseract-ocr/tesseract#40
tesseract-ocr/tesseract#465
The text was updated successfully, but these errors were encountered: