-
Notifications
You must be signed in to change notification settings - Fork 27.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[docs] Updates to TTS task guide with regards to the new TTS pipeline #26095
[docs] Updates to TTS task guide with regards to the new TTS pipeline #26095
Conversation
The documentation is not available anymore as the PR was closed or merged. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM ! Thanks for updating it
I've left some really small suggestions, feel free to ignore them!
Co-authored-by: Yoach Lacombe <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks great - thanks for adding this!
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you for adding this @MKhalusova - I was going to add the same and saw that you've already done it. Just a wee suggestion to add info about VITS
as well.
Thank you!
Co-authored-by: Vaibhav Srivastav <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Very cool - thanks for this update @MKhalusova!
To be able to use your checkpoint with a pipeline, make sure to save the processor with the checkpoint: | ||
|
||
```py | ||
>>> processor.save_pretrained("YOUR_ACCOUNT_NAME/speecht5_finetuned_voxpopuli_nl") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I believe if we omit the YOUR_ACCOUNT_NAME
prefix, the processor will just be saved under the users' namespace automatically?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I haven't tested, maybe. I think this way, it's rather consistent and definitely works :)
languages and for multiple speakers. Several text-to-speech models are currently available in 🤗 Transformers, such as | ||
[Bark](../model_doc/bark), [MMS](../model_doc/mms), [VITS](../model_doc/vits) and [SpeechT5](../model_doc/speecht5). | ||
|
||
You can easily generate audio using the `"text-to-audio"` pipeline (or its alias - `"text-to-speech"`). Some models, like Bark, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nice!
…huggingface#26095) * tts guide updates with a pipeline * Apply suggestions from code review Co-authored-by: Yoach Lacombe <[email protected]> * Update docs/source/en/tasks/text-to-speech.md Co-authored-by: Vaibhav Srivastav <[email protected]> --------- Co-authored-by: Yoach Lacombe <[email protected]> Co-authored-by: Vaibhav Srivastav <[email protected]>
…huggingface#26095) * tts guide updates with a pipeline * Apply suggestions from code review Co-authored-by: Yoach Lacombe <[email protected]> * Update docs/source/en/tasks/text-to-speech.md Co-authored-by: Vaibhav Srivastav <[email protected]> --------- Co-authored-by: Yoach Lacombe <[email protected]> Co-authored-by: Vaibhav Srivastav <[email protected]>
…huggingface#26095) * tts guide updates with a pipeline * Apply suggestions from code review Co-authored-by: Yoach Lacombe <[email protected]> * Update docs/source/en/tasks/text-to-speech.md Co-authored-by: Vaibhav Srivastav <[email protected]> --------- Co-authored-by: Yoach Lacombe <[email protected]> Co-authored-by: Vaibhav Srivastav <[email protected]>
This PR adds: