-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to fix pronunciation issues #5
Comments
Well - this is not a bug in txt2pho... The programs in this repo serve very different purposes. txt2pho is responsible At first glance both problems seem to be quite easy to solve. As always with natural Take your example
gets translated to
which means that the ordinals are not spoken correctly. OK, let's fix
which is better but still incorrect since in German ordinals are declined in the sentence.
Yes, that is basically a similar problem. The preprocessor must be able to understand At the moment no-one has volunteered to write such an elaborated version of preproc... PS: If you have a text output coming from an another programs it's rather easy to
|
I'm aware of the complexity but I'm confused why |
OK, let's focus on txt2pho...
becomes
or
|
Ah sorry, there was a dot missing. Try this: |
Yes - can see the problem now. I'm wondering what the cause for this strange behaviour is. |
I thought it was a rule defined somewhere since it actually transforms |
Did you find out anything new about this? I've seen there were some recent commits related to dates 🙂 |
@GHPS I've integrated txt2pho in the latest SEPIA-Home release 🙂 . Here are instructions to install it. I really like the voices but from time to time I find some strange artifacts (that don't appear in espeak or default MBROLA). For example if you ask SEPIA for the date you will get the answer "Heute ist der 12.05.2022" but what you hear is really weird: "Heute ist der zwölft punkt null fünf null zwei null zwei zwei punkt zweitausendzweiundzwanzig" 😅 🙈 . My "speak" script looks like this (arguments: gender, voice, text): |
Great - SEPIA is a very promissing project. I'll link to the instructions in the readme of this project. Concerning the pronunciation issue: The log-files should give some insight what is going on/wrong. I'll take a deeper look into the code in the next week... |
Great, thanks! 🙂 |
Hi @GHPS I found another issue with the pronunciation, again related to "." after numbers 😢.
The "70" will not be spoken at all. It works when I remove the "." at the end. |
Thanks for the information.
That is in principle the same problem as discussed above: txt2pho converts a That is why the preprocessor is necessary. It uses a number of heuristics to In short: Use preproc to convert numbers or whole sentences before sending |
The preprocessor has unfortunately some weird behavior as well :-/ for example: I initially removed the preprocessor because I'm doing my own processing first, but it may still be the better option compared to loosing numbers completely 😅 |
I've been experimenting with txt2pho and MBROLA and noticed something odd:
The sentence
Heute ist der 19.12.2021
will abort after19.
(pronounced "neunzehnte") :-/.I'm using this command:
echo "Heute ist der 19.12.2021" | iconv -cs -f UTF-8 -t ISO-8859-1 | ./txt2pho -m | mbrola /usr/share/mbrola/de2/de2" - test.wav
I was wondering where one can edit the rule that is responsible for this behavior.
Besides that there is a light problem with numbers at the end of a sentence, because they will always be spoken as ordinals:
"Er wurde Heute 40.". Though without context it is indeed unclear if "Er wurde Heute vierzig" or "Er wurde Heute Vierzigster" is the right version ^^.
Btw I'm avoiding 'preproc' because it has it's own set of issues 😅 🙈
The text was updated successfully, but these errors were encountered: