v1.3.14.2
Important:
This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.
Standalone Release File (3.1 GB):
Download Server:
Changelog (v1.3.14.2)
- [FEATURE] Support for audio with more than 2 channels.
- [FEATURE] Add MMS STT model
- [FEATURE] clipboard image OCR support
- [FEATURE] Add select_audio widget for Plugins
- [FEATURE] Add textfield widget type
- [FEATURE] Add Speaker diarization class (experimental)
- [FEATURE] Add noisereduce algorythm
- [TASK] Improve streamed audio playback
- [TASK] use romaji setting for translation requests
- [TASK] Update ignorelist
- [TASK] add playback hook, simplify buffer size setting
- [TASK] Separation of audio processing for recording
- [TASK] Add get languages plugin method
- [TASK] Update dependencies
- [TASK] remove downloaded zip renaming
- [TASK] Add multiple file hash check utility function
- [TASK] Send loading message over stdout instead of websocket
- [TASK] Add plugin name to plugin errors
- [TASK] Upgrade dependencies + VAD model to v5
- [BUGFIX] catch plugin exceptions to not break whole application
- [BUGFIX] Fix possible process management error if process could not be run
- [BUGFIX] error on modified value in websocket message
- [BUGFIX] streamed playback of dynamic chunk size
- [BUGFIX] tagged streamed playback
- [BUGFIX] buffer element size calculation.
- [BUGFIX] Wait for resampling until full chunk is ready for streamed playback
- [BUGFIX] resample_audio function on gpu tensors, reshaping audio data
- [BUGFIX] Faster whisper handling of non avialable precision model files
- [BUGFIX] plugin on_*_call calls not returning anything.
Full Changelog: v1.3.13.1...v1.3.14.2