About the "languages" folder and files

Most of these files are from the original software from Nakatani Shuyo. Unfortunately, the data sources from which they were generated are not available. It looks like the text comes from Wikipedia pages.

To generate your own language profile, see the main readme at https://github.com/optimaize/language-detector

km Khmer: sources available, see #19

About the "languages.shorttext" folder and files

These files are from the original software from Nakatani Shuyo.

Either they are for detecting language on short messages, or they are built from short message text, or both, I don't know.

About the "messages.properties" file

They are used in the CharNormalizer.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

About the "languages" folder and files

About the "languages.shorttext" folder and files

About the "messages.properties" file

Files

README.md

Latest commit

History

README.md

File metadata and controls

About the "languages" folder and files

About the "languages.shorttext" folder and files

About the "messages.properties" file