v1.3.14.2

Sharrnah released this 05 Jul 15:12

· 67 commits to main since this release

196d8bf

Important:

This requires a lot of configuration if run directly. Recommended way is to use UI Application: https://github.com/Sharrnah/whispering-ui which downloads this automatically.

Standalone Release File (3.1 GB):

Download Server:

Changelog (v1.3.14.2)

[FEATURE] Support for audio with more than 2 channels.
[FEATURE] Add MMS STT model
[FEATURE] clipboard image OCR support
[FEATURE] Add select_audio widget for Plugins
[FEATURE] Add textfield widget type
[FEATURE] Add Speaker diarization class (experimental)
[FEATURE] Add noisereduce algorythm
[TASK] Improve streamed audio playback
[TASK] use romaji setting for translation requests
[TASK] Update ignorelist
[TASK] add playback hook, simplify buffer size setting
[TASK] Separation of audio processing for recording
[TASK] Add get languages plugin method
[TASK] Update dependencies
[TASK] remove downloaded zip renaming
[TASK] Add multiple file hash check utility function
[TASK] Send loading message over stdout instead of websocket
[TASK] Add plugin name to plugin errors
[TASK] Upgrade dependencies + VAD model to v5
[BUGFIX] catch plugin exceptions to not break whole application
[BUGFIX] Fix possible process management error if process could not be run
[BUGFIX] error on modified value in websocket message
[BUGFIX] streamed playback of dynamic chunk size
[BUGFIX] tagged streamed playback
[BUGFIX] buffer element size calculation.
[BUGFIX] Wait for resampling until full chunk is ready for streamed playback
[BUGFIX] resample_audio function on gpu tensors, reshaping audio data
[BUGFIX] Faster whisper handling of non avialable precision model files
[BUGFIX] plugin on_*_call calls not returning anything.

Full Changelog: v1.3.13.1...v1.3.14.2

Assets 2