Build failure with leptonica 1.83 #87

risicle · 2023-01-28T21:16:55Z

Leptonica 1.83 moved a number of struct definitions into "private" headers, notably Pix and Box et al.

This causes a build failure:

src/TessTools.cpp:147:25: error: member access into incomplete type 'PIX' (aka 'Pix')
   l_uint32 *datas = pixs->data;
...

To address this, an extra import of <leptonica/pix_internal.h> needs to be added to src/TessTools.h.

On top of this, it looks like this version got rid of the library's lept alias, so references to -llept in qt-box-editor.pro need to be switched to -lleptonica.

The text was updated successfully, but these errors were encountered:

zdenop · 2023-01-29T20:16:21Z

qt-box-editor was IMO relevant for tesseract 3.x training (legacy engine) and it does not provide any value for the current tesseract version...
So what is the value if it is possible to build with the latest version of leptonica&tesseract?

risicle · 2023-01-30T19:27:22Z

Simply that older leptonica versions have security vulnerabilities meaning we (NixOS) can't ship them.

Perhaps this is an indication that we should just drop the qt-box-editor package, but as long as it's relatively straightforward to keep it building, we probably will do so with patches.

zdenop · 2023-01-31T07:30:41Z

It is not problem to include patch here, I just wander if really people are actively using this.

dpward · 2023-03-11T18:54:09Z

Yes. The current version of Tesseract still supports the OCR-based engine. The LSTM model takes significantly longer to train, according to the Tesseract documentation itself.

zdenop · 2023-03-11T20:56:23Z

LSTM engine does not need to be trained from scratch (legacy engine has to). E.g. you can train and extend only problems.
IMO LSTM training is (could be) faster as you do not need to take care about bounding boxes of letters and training based on tutorials like this seem to be pretty easy.

Anyway I made requested changes of QTB code.

dpward · 2023-03-11T21:02:14Z

Unfortunately LSTM doesn't seem to work well on matching basic monospace without word recognition.

zdenop · 2024-10-14T19:04:14Z

fixed.

zdenop closed this as completed Oct 14, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Build failure with leptonica 1.83 #87

Build failure with leptonica 1.83 #87

risicle commented Jan 28, 2023

zdenop commented Jan 29, 2023

risicle commented Jan 30, 2023

zdenop commented Jan 31, 2023

dpward commented Mar 11, 2023 •

edited

Loading

zdenop commented Mar 11, 2023

dpward commented Mar 11, 2023 •

edited

Loading

zdenop commented Oct 14, 2024

Build failure with leptonica 1.83 #87

Build failure with leptonica 1.83 #87

Comments

risicle commented Jan 28, 2023

zdenop commented Jan 29, 2023

risicle commented Jan 30, 2023

zdenop commented Jan 31, 2023

dpward commented Mar 11, 2023 • edited Loading

zdenop commented Mar 11, 2023

dpward commented Mar 11, 2023 • edited Loading

zdenop commented Oct 14, 2024

dpward commented Mar 11, 2023 •

edited

Loading

dpward commented Mar 11, 2023 •

edited

Loading