The goal of the project is to help you practice listening comprehension.
It works by giving you random sentences in the language you're learning and asking you to fill in the gaps. The sentences were submitted by contributors to Mozilla Common Voice platform.
The sentences are ranked according to difficulty, and you can choose a level from 1 to 10 where 1 is the easiest and 10 is the hardest.
The project aims to not require any knowledge of a meta language in order to start learning. If you are interested in a more traditional course creation project, check out LibreLingo.
- Fill in the blanks: A cloze-style task
- Pick the right one: Get two options and choose the right one
- Drag and drop: Get a set of tiles and drag them to build a sentence
Tab
: Play the recordingEnter
: Submit and check if you got it rightSpace
: Next sentence
The data comes from the Common Voice dataset releases.
This system is designed with two main user groups in mind:
- People who want to learn a new language
- People who want to learn how to write their native language
The system endeavours to be audio first, with knowledge of writing built up by hearing.
- IRC:
irc.freenode.net
#OmniLingo
- Matrix:
#OmniLingo:matrix.org
(access via Element) - Telegram: OmniLingo
All of the languages available in Common Voice 6.1 dataset.
Abkhaz · Arabic · Assamese · Breton · Catalan · Hakha Chin · Czech · Chuvash · Welsh · German · Dhivehi · Greek · English · Esperanto · Spanish · Estonian · Basque · Persian · Finnish · French · Frisian · Irish · Hindi · Upper Sorbian · Hungarian · Interlingua · Indonesian · Italian · Japanese · Georgian · Kabyle · Kyrgyz · Luganda · Lithuanian · Latvian · Mongolian · Maltese · Dutch · Odia · Punjabi · Polish · Portuguese · Romansh Sursilvan · Romansh Vallader · Romanian · Russian · Kinyarwanda · Sakha · Slovenian · Swedish · Tamil · Thai · Turkish · Tatar · Ukrainian · Vietnamese · Votic · Chinese (China) · Chinese (Hong Kong) · Chinese (Taiwan)
If you want to work with a language not yet in Common Voice, we highly recommend that you get set up in Common Voice, but in the meantime, you can check out the format guidelines.
To bootstrap the project for Finnish, git clone
the repository, then run the following
commands:
cd templates && wget --no-check-certificate http://cl.indiana.edu/\~ftyers/cv/cv-corpus-6.1-2020-12-11.tar.gz -O- | tar zxf - && cd ..
pip install -r requirements.txt
./main.py
To add more languages, download a dataset from Common Voice and
put it in templates/cv-corpus-6.1-2020-12-11
.
The project should be accessible through http://localhost:5001/
Happy hacking! :)