-
Notifications
You must be signed in to change notification settings - Fork 472
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Poor audio quality after fine-tuning #49
Comments
For 4, did you change |
I fine-tuned the model with
Do you know what is the minimal value for decent results? Unfortunately, I cannot use 400, but maybe I could set it a bit higher than 100 if I reduce batch_size even more. Training speed is not a concern for me.
Yes, that sounds much better. Could you please share inference parameters? Would be awesome if you still have alpha/beta values and the name of the reference clip, so I can compare my results using the same values. Thanks! |
Yes, you can leave I haven't really tested with different |
I'm trying to fine-tune the LibriTTS checkpoint on ~1 hour of LJSpeech but get poor results. Could you please give me some directions or help to spot the issue?
How I fine-tuned:
Data/train_list.txt
with a copy that only has the first 1000 lines (~1 hour for training)Inference_LibriTTS.ipynb
andInference_LJSpeech.ipynb
notebooks by changing themultispeaker
parameter in the config to true/false.Inference_LJSpeech.ipynb
produces very noisy results with a poor pronunciation.Inference_LibriTTS.ipynb
with reference audio from LJSpeech has a good pronunciation, but there are noticeable noises (example - https://voca.ro/1nQ8Ltjhsh9y)Thank you again for the awesome project!
The text was updated successfully, but these errors were encountered: