Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Unused words.txt Generated in prepare.sh for GigaSpeech Recipe #1756

Open
yfyeung opened this issue Sep 25, 2024 · 0 comments
Open

Unused words.txt Generated in prepare.sh for GigaSpeech Recipe #1756

yfyeung opened this issue Sep 25, 2024 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@yfyeung
Copy link
Collaborator

yfyeung commented Sep 25, 2024

It seems that the words.txt generated in the following section of the prepare.sh script is overwritten and ultimately unused:

https://github.com/k2-fsa/icefall/blob/master/egs/gigaspeech/ASR/prepare.sh#L219-L242

This happens because the words.txt file is later overwritten by another one generated from lexicon.txt in this step:

https://github.com/k2-fsa//icefall/blob/master/egs/gigaspeech/ASR/prepare.sh#L255

Specific logic can be observed in:

https://github.com/k2-fsa//icefall/blob/master/egs/librispeech/ASR/local/prepare_lang.py#L377

@yfyeung yfyeung added bug Something isn't working invalid This doesn't seem right and removed bug Something isn't working labels Sep 25, 2024
@yfyeung yfyeung added bug Something isn't working and removed invalid This doesn't seem right labels Oct 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants