Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

θ in Greek book font rendered as swash form #56

Open
nisbet-hubbard opened this issue Dec 23, 2023 · 2 comments
Open

θ in Greek book font rendered as swash form #56

nisbet-hubbard opened this issue Dec 23, 2023 · 2 comments
Labels
enhancement New feature or request

Comments

@nisbet-hubbard
Copy link

IMG_1991

OCR result: ϑεοὶ γὰρ οὔποτ᾽,

This is an ordinary book font used by editions of classical texts. Because the design of its theta, however, this letter is frequently OCR’ed as a swash form and requires manual correction as it stands out from the rest of the text when rendered in other (esp. sans) Greek fonts.

@stweil
Copy link
Member

stweil commented Dec 23, 2023

So improved training is necessary. Do you know a freely available computer font which emulates that design? Or is there a ground truth data set which can be used to train recognition of that font?

@stweil stweil added the enhancement New feature or request label Dec 23, 2023
@nisbet-hubbard
Copy link
Author

Yes, there is! There’s two fonts with this sort of theta and rho, both under the open font licence.

GFS Heraklit: the text in the image probably used the italic of this. Scroll down for the download.

GFS Artemisia: in a slightly different style.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

2 participants