-
Notifications
You must be signed in to change notification settings - Fork 515
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feel free to post any issues or questions about Cognitive TTS service here! #128
Comments
Welcome post like
|
It seems the output is limited to 10 minutes of audio (at least using the nural option). What if I want to process a long text file, like a required reading or a chapter of a book? |
I´m using voice pt-PT-HeliaRUS with language pt_PT for a chatbot in a project for a client. "O seu e-mail é <say-as interpret-as="characters">[email protected]" The email is not being spelled. I tried other voices and languages like : pt-BR-HeloisaRUS pt_BR and en-US-JessaRUS en-US. Only in the voice "en-US-Jessa24kRUS" he spells the name. Can you tell me why ? By the way , we have a workaround to force spelling that is separate the email text with spaces: Is this a problem with pt-PT language? And how can i have better results spelling emails correctly ? |
How do I control the pace of the generated speech. I need to slow it down by 10%. |
Hello, I have tried both of the below, but I get "InitialSilenceTimeout" for every recording that I try. As a test, I ran the same recording through at 16 kHz, and got a "RecognitionStatus" of "Success". I then resampled it to 44100 kHz, and I got "InitialSilenceTimeout". I have an example using the speech SDK that works, but now I need to use 44 kHz audio data with the REST API. Any advice would be greatly appreciated. Thank you! |
Text to Speech does not set the right pitch if two pitches are set in one request. Sample SSML:
I have a sentence with two parts of it set to pitch=28%. Please note this is happening with all the voices and looks like a major bug. Please test this in US East region. |
I'm using the Rest API, to synthetize text to speech, but I'd like to know to play the response. Any ideas how to convert and play the response? My request:
|
This pertains to commit: d457a6d Is it possible to share the Python / Node version of the code? Thanks! |
we encourage developers to post issues / questions in this forum.
It is monitored regularly
The text was updated successfully, but these errors were encountered: