-
Notifications
You must be signed in to change notification settings - Fork 1
'grctraining' repository from http://ancientgreekocr.org/. Rules and tools to deterministically generate all prerequisites for the final training process.
License
ryanfb/ancientgreekocr-grctraining
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
Source files for some automatically generated parts of the Ancient Greek (grc) training for Tesseract OCR. Specifically, this contains the Makefile and its prerequisites to build the following files needed for the grc training: - training_text.txt - grc.word.txt - grc.freq.txt - grc.unicharambigs - grc.wordlist # Dependencies The tool tlgu is required. Download and install it from: http://tlgu.carmen.gr/ On a Mac with homebrew, install coreutils and gnu-sed (needed for gsed, gmktemp, gshuf). # To build the training parts Note that the build starts by downloading and unpacking a text corpus from which to generate the wordlists. Make all of the parts with the command: make
About
'grctraining' repository from http://ancientgreekocr.org/. Rules and tools to deterministically generate all prerequisites for the final training process.
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published