-
-
Notifications
You must be signed in to change notification settings - Fork 344
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Inference of Multiband MelGAN (v2) with ForwardTacotron #346
Comments
Not sure but the following points might be different:
If you use the same mel basis, maybe log is different. You can change the log basis to match the feature extraction condition. |
Is there a way to circumvent without re-training the text2mel model? |
we have tried to fix this by using np.log10(np.exp(ft_mel_output)) and normalize it using the mean var normalization with standardScaler |
If fmin / fmax are different, there is no way to use pretrained model. |
I have trained a ForwardTacotron text2mel model and I would like to integrate it into Parallelwavegan.
For now we have extracted the genrated mel_post (Melspectrogram after Postnet) from here and saving it as a .npy file (alifiya_esp_1.npy.zip). Then we use StandardScalar to normalize the data from here (1 and 2) and infer through here.
This is the sample output of the corresponding numpy file (alifiya_esp_1.wav.zip)
The same mel is working fine with GriffinLim.
Where am I going wrong?
The text was updated successfully, but these errors were encountered: