Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Missed letter in the hye.traineddata #49

Open
reneclais opened this issue Sep 21, 2023 · 3 comments
Open

Missed letter in the hye.traineddata #49

reneclais opened this issue Sep 21, 2023 · 3 comments

Comments

@reneclais
Copy link

reneclais commented Sep 21, 2023

In the hye.traineddata the letter և is not included. This letter is replaced by the letter ն . Indeed the two letters aspect are very similar, but they have not the same signification. I have found that in the old arm.traineddata there is no such a problem.

@stefan6419846
Copy link

This is the wrong repository for reporting this in my opinion.

Nevertheless, there is no arm model in the official repositories, only an ara and an asm one. The general configuration is in the langdata and langdata_lstm repositories, the trained models are in the tessdata* repositories. As the models have been trained by Google most of time, there probably will not be any change to fix this character, but you might decide to train your own fixed model and maybe provide it to the public inside the tessdata_contrib repository.

@stweil
Copy link
Member

stweil commented Sep 21, 2023

Armenian.traineddata contains the missing character, so I suggest to try that model.

@stweil
Copy link
Member

stweil commented Sep 21, 2023

I'll transfer this issue from tesstrain to langdata_lstm where it fits better.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants