-
Notifications
You must be signed in to change notification settings - Fork 782
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
VoiceCraft Fine-tune dataset preparation #138
Comments
I would suggest you go through
since phonemes are treated as text tokens, you need to load a map that map phoneme character to index for the model to generate phoneme embedding, an example phoneme map is stored at any of the pretrained weight: You don't need to follow the exact file structure the dataset required, as long as your
where
Once you defined your dataset that output the above, you can pretty much run the training as it is by changing the dataset from The phonemes and encodec generation are both in
So if you look at any of the pretrained model
So what you need to do is:
At last, at your training script, indicate:
hope this helps |
Well thanks for taking your time to explain this. I will try again sometime in the future and see how it goes! |
Hello
I've been going through the the instructions on gitpage however I was not able to figure out how one prepares their own dataset for finetuning.
Could anyone share this if they have been successful.
The text was updated successfully, but these errors were encountered: