Test LSTM OCR Engine in Tesseract #30

afolarin · 2017-07-08T09:34:36Z

Need to setup some basic benchmarks to test this. Keen to know if it helps with rotations.

https://github.com/tesseract-ocr/tesseract/wiki/4.0-with-LSTM
tesseract-ocr/tesseract#40
tesseract-ocr/tesseract#465

jstuczyn · 2017-07-10T14:18:24Z

I've just updated the Tesseract version on our dev environment to 4.0.0alpha and rerun the problematic test (the one with document with rather bad quality, where the tables are not aligned, etc) and the result is significantly better than on the older version.
For comparison:
Actual text:

(2) If only one person is ticket *(part of letter "d' is not visible and in fact it looks like a "t")* in the final two columns then they are the nearest relatives.

3.04.01:

{2} Heel}; 0ch persnn is tieket in the final twrr eeltrnms their Hwy are the nearest relatives.

4.0.0alpha:

(2) If only one person is ticket in the final twn columns then they are the nearest relatives,

However, that does not mean 4.0.0 only makes tiny mistakes like two -> twn, but there are significantly less of them and it is possible to understand the meaning of a sentence regardless of them.

afolarin · 2017-07-10T16:27:58Z

Yeah, I was keen to see how much better the LSTM OCR engine is. Given that it's still in alpha, but sounds like it is both more accurate and faster

afolarin · 2017-09-27T16:02:48Z

TODO set as default configuration

lrog · 2018-11-23T17:45:59Z

The precision and recall of the new tesseract has been significantly improved according to the official tesseract wiki. On few our tests we could also notice significant improvements running locally on MTSamples.

The openjdk v.11 base image (based on Debian) already provides in the repository the new version of tesseract. Hence, with PR #65 we added the new version of tesseract to be used by default within the upcoming version of CogStack Pipeline image. The same for TravisCI builds.

I'm closing this issue, however, creating a proper benchmark suite to gather precision/recall/performance metrics would be a good idea to have a direct metrics for future improvements.

afolarin assigned afolarin and jstuczyn Jul 8, 2017

afolarin mentioned this issue Sep 27, 2017

OCR Test failure #4

Closed

lrog closed this as completed Nov 23, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Test LSTM OCR Engine in Tesseract #30

Test LSTM OCR Engine in Tesseract #30

afolarin commented Jul 8, 2017 •

edited

Loading

jstuczyn commented Jul 10, 2017

afolarin commented Jul 10, 2017

afolarin commented Sep 27, 2017

lrog commented Nov 23, 2018

Test LSTM OCR Engine in Tesseract #30

Test LSTM OCR Engine in Tesseract #30

Comments

afolarin commented Jul 8, 2017 • edited Loading

jstuczyn commented Jul 10, 2017

afolarin commented Jul 10, 2017

afolarin commented Sep 27, 2017

lrog commented Nov 23, 2018

afolarin commented Jul 8, 2017 •

edited

Loading