We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
User added documents with Norwegian text and Datashare correctly identified Norwegian text and language:
But it seems that the OCR can't recognize Æ,Ø and Å:
I read this issue here: tesseract-ocr/langdata#36, it sounds like the issue was fixed but not sure...
Shall we check the last version of Tesseract ?
The text was updated successfully, but these errors were encountered:
The container is rebuilt at each release with the latest alpine tesseract-ocr package. It seems to be the latest.
/home/datashare # apk info tesseract-ocr tesseract-ocr-4.0.0-r0 description: open source OCR engine tesseract-ocr-4.0.0-r0 webpage: https://github.com/tesseract-ocr/tesseract/releases tesseract-ocr-4.0.0-r0 installed size: 40067072
But another issue is mentioning that there is still Ø and Å missing : tesseract-ocr/langdata#91
Sorry, something went wrong.
Waiting for issue 91 fixing in tesseract
bamthomas
No branches or pull requests
User added documents with Norwegian text and Datashare correctly identified Norwegian text and language:
But it seems that the OCR can't recognize Æ,Ø and Å:
I read this issue here: tesseract-ocr/langdata#36, it sounds like the issue was fixed but not sure...
Shall we check the last version of Tesseract ?
The text was updated successfully, but these errors were encountered: