Important:
This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.
Standalone Release File (3.1 GB):
Download Server:
Changelog (v1.3.14.8)
- [FEATURE] Add F5 TTS
- [FEATURE] Add option to translate to more than one target language
- [FEATURE] Add OSC Server to synchronize with VRChat Mute state
- [FEATURE] Add support to load a user custom model
- [TASK] Add reload voices event
- [TASK] Update dependencies
- [TASK] Initialize TTS after UI connected
- [TASK] Only send source + translation if both actually exist
- [TASK] remove direct-ml for linux
- [TASK] Add large-v3-turbo model for faster-whisper
- [TASK] Open playback audio device directly with detected informations instead of trying multiple options
- [TASK] return audio segments in faster whisper
- [TASK] additional translation improvements
- [TASK] Upadate ctranslate library
- [BUGFIX] use defined exclude_client for BroadcastMessage
- [BUGFIX] Add possible stream playback fix
- [BUGFIX] Add linux build portaudio dependency
- [BUGFIX] Return correct download status on fallback download
- [BUGFIX] Error if invalid F5/E5 model is requested
Full Changelog: v1.3.14.6...v1.3.14.8