-
Notifications
You must be signed in to change notification settings - Fork 140
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kraken 5.2.4 on eScriptorium recognition artefacts #605
Comments
looks like shapely |
the polygon is too big and the recognizer wasn't trained on lines where the letters are only a quarter of the line height. |
The models I used in this test: |
It isn't a model issue but the polygonization is wrong. I'll have a look. The rotation code changed between 4.x and 5.x so it's either that or other shapely shenanigans. |
Could you also send me the image file and any ALTO/PageXML you've got? It's difficult to debug without being able to run a test case. |
export_doc23_memar_marqah_mcdonald_alto_202405131147.zip |
Thanks. It's mostly so I can make sure the baselines are identical. |
Any update on this matter? |
Apparently, the error persists on some other image data. |
Nope, not true after all. Just crappy output of the polygonizer. |
I'm not sure if is eScriptorium or kraken related, I just want to poin out, same model, same image on different installs:
Both segmentation and recognition models were trained on kraken 5.2.4
The text was updated successfully, but these errors were encountered: