-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
the output of LSTM #13
Comments
why should we add noise to the lexicon(noises phonemes to the units) ? |
if they truly don't exist in your training data, you can safely delete them |
Thanks a lot. |
using tri-phone as the model unit in essen is possible, u might further generate context label (fstcomposecontext) as in HMM system, and replace tri-phone label in T.fst. |
sorry ,i don't got it. you mean that I should generate the tri-phone by the hybird pipeline or by the commmand ----"fstcomposecontext".the first or the second ?@chenzhehuai |
clustered tri-phone should be generated from hybrid system through clustering; while context in WFST can be generated by fstcomposecontext with extra mapping from tri-phone to clustered tri-phone |
An even simpler way is to generate forced alignment with the GMM-HMM, and take the CD states as CI CTC labels. With this, there is no need to consider context dependency in decoding. |
hello,in the stage of decoding, the problem occur as follows:
|
First, thanks for your help all the time. And I have been being confused by the modeled units all the time .For instance : The unit.txt
And I wonder why we should model the first phone and the second phone ,Actually,both of them don't exist in my training label.Can I delete them and not model them ?
any help would be appreciated.
The text was updated successfully, but these errors were encountered: