AI Voice Support? #165

gamer805 · 2025-02-20T19:00:44Z

I am working on a project that involves animating a chatbot, and wish to use rhubarb to animate the lip-syncing. However, after testing with both the PocketSphinx and Phonetic recognizers, neither seem to register individual phonemes, instead only resulting a rest output. This is the output I have consistently gotten testing on AI voices:

0.00 X

It should be said that in the same environment, rhubarb does work for recognizing recordings of my own voice, so I assume it has to do with the structure of AI generated audio files themselves. I wonder if support could possibly be added for tweaking sensitivity on the user's end, or if adding support for AI voices may require supporting an entirely new speech recognizer.

DanielSWolf · 2025-02-26T14:21:00Z

That sounds strange. It wouldn't surprise me if the results with AI voices were worse than for regular recordings, but seeing not output at all is unexpected. Could you attach an example file, so that I can reproduce the issue?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AI Voice Support? #165

AI Voice Support? #165

gamer805 commented Feb 20, 2025

DanielSWolf commented Feb 26, 2025

AI Voice Support? #165

AI Voice Support? #165

Comments

gamer805 commented Feb 20, 2025

DanielSWolf commented Feb 26, 2025