Feature request - Ukrainian model #30

egorsmkv · 2020-11-05T10:18:53Z

🚀 Feature

We would like to have a Ukrainian model for the task of Speech-to-Text.

Motivation

Ukraine has a large population and in the country and there are tons of tasks related to Speech-to-Text.

Additional context

Our group that is based in Telegram ( https://t.me/speech_recognition_uk ) collected a dataset of Ukrainian public speeches/interviews in audio and text formats accessed here: https://mega.nz/folder/T34DQSCL#Q1O8vcrX_8Qnp27Ge56_4A/folder/O3hzlKIJ

We think this dataset will be helpful in the training process.

egorsmkv · 2020-11-05T11:58:34Z

Also we have own repository where we’re collecting links to datasets: https://github.com/egorsmkv/speech-recognition-uk

snakers4 · 2020-11-05T12:02:24Z

Hi,

This is exactly the effort I would expect from the community for low-resource languages
For now - I will just fit a model on your data as is and share the model via silero-models
Then when V3 compact models arrive for all languages, I will consider tuning a Russian model on your corpus

https://github.com/egorsmkv/speech-recognition-uk

Just a few ideas on how to make your repo better:

Add some table with overall statistics
Add some commands (maybe some cli to download your files)
Direct links are always nice

egorsmkv · 2020-11-05T15:54:18Z

Thanks for your suggestions!

snakers4 · 2020-11-06T16:51:39Z

Please see
#20 (comment)

snakers4 · 2020-11-07T07:23:07Z

I updated the models and purged the CDN cache.

snakers4 · 2020-11-07T07:24:31Z

It is unlikely that much will change soon, so if everything works let's close the ticket.

egorsmkv added the enhancement New feature or request label Nov 5, 2020

egorsmkv assigned snakers4 Nov 5, 2020

snakers4 mentioned this issue Nov 6, 2020

Changelog #20

Open

egorsmkv closed this as completed Nov 7, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature request - Ukrainian model #30

Feature request - Ukrainian model #30

egorsmkv commented Nov 5, 2020 •

edited

Loading

egorsmkv commented Nov 5, 2020

snakers4 commented Nov 5, 2020

egorsmkv commented Nov 5, 2020

snakers4 commented Nov 6, 2020

snakers4 commented Nov 7, 2020

snakers4 commented Nov 7, 2020

Feature request - Ukrainian model #30

Feature request - Ukrainian model #30

Comments

egorsmkv commented Nov 5, 2020 • edited Loading

🚀 Feature

Motivation

Additional context

egorsmkv commented Nov 5, 2020

snakers4 commented Nov 5, 2020

egorsmkv commented Nov 5, 2020

snakers4 commented Nov 6, 2020

snakers4 commented Nov 7, 2020

snakers4 commented Nov 7, 2020

egorsmkv commented Nov 5, 2020 •

edited

Loading