Skip to content

Releases: Sharrnah/whispering

v1.3.11.1

26 Sep 08:34
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.11.1)

  • [FEATURE] Implement faster resampling method (old method is still available as audio_tools.resampy_audio()
  • [FEATURE] Add new faster-whisper settings (for example: repetition_penalty to give AI a penalty for repetition.)
  • [FEATURE] Add Seamless-M4T model as multi-modal model. (Speech to Text Translation + Text Translation for now)
  • [FEATURE] Implement auto building.
  • [BUGFIX] EasyOCR Bugfix
  • [BUGFIX] nltk punkt download on pyinstaller bundle for M2M100 model
  • [TASK] Update CTranslate2
  • [TASK] Disable Denoise AI by default (since depending on the audio, it might filter too much, like for music)
  • [TASK] Update libraries
  • [TASK] Update ignorelist

Full Changelog: v1.3.10.2...v1.3.11.1

v1.3.10.2

19 Aug 17:50
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.10.2)

  • [FEATURE] Manage audio playback in threads, allow stopping audio
  • [FEATURE] Provide way to handle playing audio via tags.
  • [FEATURE] Add osc minimum frequency and queue system
  • [FEATURE] Add support for custom ignorelist
  • [TASK] Add osc_type_transfer option "both_inverted"
  • [TASK] Add Silero fallback download server
  • [TASK] Add option to force non-ui downloads
  • [TASK] Add fairseq dependency
  • [BUGFIX] Fix possible encoding issue with audio device names. (hopefully)
  • [BUGFIX] Do not keep button values in plugin settings
  • [BUGFIX] downloader removing non-zip files.
  • [BUGFIX] Safely decode yaml settings
  • [BUGFIX] Add sentence splitting to M2M100 text translator

Full Changelog: v1.3.9.2...v1.3.10.2

v1.3.9.2

13 Aug 04:45
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.9.2)

  • [BUGFIX] correctly convert chinese audio markers to wav
  • [BUGFIX] calculate phrase_time_limit based on correct sample rate
  • [BUGFIX] Do not add marker prompt if try_count = -1
  • [BUGFIX] send valid languages struct if no STT
  • [TASK] debounce settings writes
  • [TASK] correctly save vad_confidence_threshold as float instead of string

Full Changelog: v1.3.9.1...v1.3.9.2

v1.3.9.1

09 Aug 21:40
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.9.1)

  • [HOTFIX] missing NLTK punkt dependency in standalone build
  • [FEATURE] Add EU finetune models
  • [FEATURE] Add silence part deletion for recordings
  • [FEATURE] Add normalization
  • [FEATURE] Allow disabling STT model
  • [FEATURE] Implement Voice Markers as mentioned in Sharrnah/whispering-ui#10
  • [BUGFIX] resample merged audio instead of each chunk seperately to prevent audio issues
  • [BUGFIX] correctly splitting OSC chunks with ...
  • [BUGFIX] break out of chunked sending early for new message
  • [BUGFIX] Split by sentences before word tokenization for NLLB200
  • [BUGFIX] fix a possible memory leak
  • [TASK] Only try downloading nltk if not already existing
  • [TASK] no byte conversion for enhance_audio function (to only convert if necessary)
  • [TASK] use async callback for recordings

Full Changelog: v1.3.8.3...v1.3.9.1

v1.3.8.3

31 Jul 01:45
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.8.3)

  • [FEATURE] Add finetuned models
  • [FEATURE] Add OSC chat types. (scrolling, chunk, full)
  • [FEATURE] Add option to pause speech 2 text processing
  • [TASK] Add length_penalty + beam_search_patience faster-whisper options
  • [BUGFIX] subprocess freeze on full buffer for stdout / err

Full Changelog: v1.3.8.1...v1.3.8.3

v1.3.8.1

26 Jul 11:58
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.8.1)

  • [FEATURE] Add Speech 2 Speech Plugin method
  • [FEATURE] Add A.I. Denoiser
  • [FEATURE] Add processmanager to better handle child processes
  • [BUGFIX] Fix error on base64 debug messages
  • [TASK] Add additional Plugin event type
  • [TASK] Update dependencies
  • [TASK] add bfloat16 options

Full Changelog: v1.3.7.2...v1.3.8.1

v1.3.7.2

13 Jul 17:35
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.7.2)

  • [FEATURE] add global extract + move_files functions
  • [TASK] Add whisper timestamp options
  • [TASK] Improve download functions
  • [TASK] Update modules

Full Changelog: v1.3.7.1...v1.3.7.2

v1.3.7.1

03 Jul 18:01
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.7.1)

  • [TASK] Update PyAudio
  • [TASK] Allow disabling Text translator
  • [BUGFIX] Fix EasyOCR incompatibility with Pillow v10
  • [BUGFIX] Allow precision setting for M2M100
  • [BUGFIX] OSC lang prefix error when using M2M100

Full Changelog: v1.3.6.7...v1.3.7.1

v1.3.6.7

02 Jul 19:34
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.6.7)

  • [FEATURE] Add Speech T5 Speech2Text model
  • [BUGFIX] settings yaml creation on energy detection run
  • [BUGFIX][#15] Download possibly missing realtime whisper model
  • [BUGFIX] language listing for single language models
  • [BUGFIX] Fix M2M100 + NLLB200 translators
  • [TASK] Update dependencies

Full Changelog: v1.3.6.6...v1.3.6.7

v1.3.6.6

29 May 16:47
Compare
Choose a tag to compare

Standalone Release File (2.6 GB):

Download Server:

Changelog (v1.3.6.6)

  • [FEATURE] Add support for specific widgets used in plugin settings
  • [FEATURE] Add support for plugin widget events
  • [BUGFIX] fix error on get_plugin_setting if no init_plugin_settings was used.
  • [TASK] Update plugin creation readme
  • [TASK] Update some dependencies

Full Changelog: v1.3.6.5...v1.3.6.6