You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The tesseract command line tool is able to generate only one lstmf file at a time.
When using single line images for training this means to generate 20/50k lstmf files or more and this can take hours.
Expected Behavior:
It should be possible to generate lstmf files for a at least a few thousands files without incurring in the overhead of starting a new tesseract process for each file.
I would expect the execution time to be reduced to a few minutes if not seconds.
Suggested Fix:
The executable should accept a list of files (or a folder and process all the tif files found there).
The text was updated successfully, but these errors were encountered:
I wonder whether lstmf files are useful at all. The training process could create the necessary data in memory on the fly instead of reading lstmf files.
Environment
Current Behavior:
The tesseract command line tool is able to generate only one lstmf file at a time.
When using single line images for training this means to generate 20/50k lstmf files or more and this can take hours.
Expected Behavior:
It should be possible to generate lstmf files for a at least a few thousands files without incurring in the overhead of starting a new tesseract process for each file.
I would expect the execution time to be reduced to a few minutes if not seconds.
Suggested Fix:
The executable should accept a list of files (or a folder and process all the tif files found there).
The text was updated successfully, but these errors were encountered: