Don't drop words with low certainty #1264

amitdo · 2018-01-10T16:29:45Z

Fix #681.

amitdo · 2018-01-23T12:50:49Z

@theraysmith, @jbreiden,
Please review and approve this PR. I want this in to be in Ubuntu 18.04.

zc813 · 2018-02-20T14:14:36Z

Hi amitdo, I encountered the same problem. Is there anything I can do to help with this pull request?

amitdo · 2018-02-20T15:51:17Z

I think it's a good idea to test it on several languages and variety of pages.

Apart from this, someone with the right permissions will need to merge it...

zc813 · 2018-02-21T09:15:39Z

@amitdo Some of the results are better on Tibetan. Previously missing words are recognized after this commit.
However, there are still several entire lines of text missing. Different lines are skipped if a different psm (3, 6, or 11) is used. It's like #538 , but in my case, all texts are in the same size.

Any idea? Thanks a lot!

Don't drop words with low certainty

cd9797b

Fix #681.

amitdo mentioned this pull request Jan 25, 2018

Handle null raw_choice - fixes #235 #246

Closed

zdenop merged commit 766b7bd into tesseract-ocr:master Feb 20, 2018

amitdo deleted the dontdropwords branch February 20, 2018 16:23

zc813 mentioned this pull request Feb 21, 2018

Entire lines of text missing. Different missing when psm = 3, 6, 11 #1339

Open

amitdo mentioned this pull request Oct 15, 2018

Best Traineddata Feedback - Gujarati - ન - ત Confusion tesseract-ocr/tessdata#60

Open

amitdo mentioned this pull request Mar 19, 2021

Latin.traineddata(best) - Words missing in OCR #1080

Closed

amitdo added the feature request label Mar 21, 2021

peterlawkp mentioned this pull request Nov 19, 2022

Terminate because of 'std::bad_alloc' #3966

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Don't drop words with low certainty #1264

Don't drop words with low certainty #1264

amitdo commented Jan 10, 2018

amitdo commented Jan 23, 2018

zc813 commented Feb 20, 2018

amitdo commented Feb 20, 2018

zc813 commented Feb 21, 2018 •

edited

Loading

Don't drop words with low certainty #1264

Don't drop words with low certainty #1264

Conversation

amitdo commented Jan 10, 2018

amitdo commented Jan 23, 2018

zc813 commented Feb 20, 2018

amitdo commented Feb 20, 2018

zc813 commented Feb 21, 2018 • edited Loading

zc813 commented Feb 21, 2018 •

edited

Loading