Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

AI Voice Support? #165

Open
gamer805 opened this issue Feb 20, 2025 · 1 comment
Open

AI Voice Support? #165

gamer805 opened this issue Feb 20, 2025 · 1 comment

Comments

@gamer805
Copy link

I am working on a project that involves animating a chatbot, and wish to use rhubarb to animate the lip-syncing. However, after testing with both the PocketSphinx and Phonetic recognizers, neither seem to register individual phonemes, instead only resulting a rest output. This is the output I have consistently gotten testing on AI voices:

0.00 X

It should be said that in the same environment, rhubarb does work for recognizing recordings of my own voice, so I assume it has to do with the structure of AI generated audio files themselves. I wonder if support could possibly be added for tweaking sensitivity on the user's end, or if adding support for AI voices may require supporting an entirely new speech recognizer.

@DanielSWolf
Copy link
Owner

That sounds strange. It wouldn't surprise me if the results with AI voices were worse than for regular recordings, but seeing not output at all is unexpected. Could you attach an example file, so that I can reproduce the issue?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants