Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ASR training using ESPnet2 library calls #16

Open
Tirthankar-iiitb opened this issue Apr 14, 2021 · 2 comments
Open

ASR training using ESPnet2 library calls #16

Tirthankar-iiitb opened this issue Apr 14, 2021 · 2 comments

Comments

@Tirthankar-iiitb
Copy link

Hi - I am looking for an example notebook where I want to train an ASR on a dataset such as TIMIT using ESPnet2 library calls. The data preparation is required to be done separately in Python (not using recipes) for 'sound' or 'npy' (not Kaldi style) as would be required. Any pointer on the training part would be helpful./Tirthankar.

Here is my experiment but this is giving error on epoch 1 training.
timit_train_espnet2.md

@sw005320
Copy link
Contributor

Thanks.
We actually do not prepare an espnet2 ASR example, so it would be great if you make it work and report it.

According to the log file you attached, it seems that the input data has some issues.
Your current setup of espnet2 assumes the time domain waveform (16k sampling) instead of speech features.
Could you check it?
Also, you would need to compute the mean and variance statistics.

@Tirthankar-iiitb
Copy link
Author

Tirthankar-iiitb commented Apr 15, 2021 via email

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants