TaSTT, a free self-hosted STT for VRChat #157
yum-food
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
TaSTT is a free and open source speech-to-text for VRChat. It uses OpenAI's whisper to transcribe speech to text, and streams it into the game via OSC.
Notably, all language services are performed on the client. This allows the service to be free, reliable, and very fast. End-to-end latency (with reasonable settings, hardware, and using the custom chatbox) is typically under 0.5 seconds.
I'm using faster-whisper, a compute-optimized implementation of Whisper. Most VR users are able to handle using this library with the base.en model.
It has a bunch of features:
I've been working on this as a passion project for almost a year. It started as a "weekend project" and grew into the largest software project I've ever made. There are about 170 users now.
Going forward, there are lots of things on the agenda:
... so the point is, this is an active project with a future.
Links:
Cheers!
yum
PS: Please add an OSC endpoint to explicitly dismiss the built-in chatbox, it would help improve the UX for mute players who use speech-to-text tools :)
Beta Was this translation helpful? Give feedback.
All reactions