Skip to content

v0.2.4-beta

Compare
Choose a tag to compare
@t41372 t41372 released this 15 Oct 04:21
· 102 commits to main since this release
99d5ad1

Release Notes - Version 0.2.4

It's been a week since the last release, so here is a new release.

What's New

  • Feature: xTTSv2 TTS Engine Support
    Added support for the xtts-api-server, which now integrates with the xTTSv2 text-to-speech engine. Thanks to @Eggze2 for contributing! #23

  • Feature: Environment Variable Support in conf.yaml
    You can now reference environment variables directly in the conf.yaml file using the ${ENV_VAR_NAME} syntax. This eliminates the need for explicit values in the configuration file by dynamically loading them from the environment.

Bug Fixes

  • Hands-Free Voice Interactions (CLI)
    Restored hands-free voice interactions in the CLI, which were previously asking for key presses at the end of each conversation turn. This was not intended, and the functionality is now working as expected.

Improvements

  • CLI Interruption Stability
    Improved the stability of interruptions in CLI mode. The system now knows which sentence was interrupted, preventing sentences it didn't had a chance to say from being stored in the LLM's memory. This behavior is now consistent with the Live2D mode.

Changes

  • Randomized Cache Audio Filenames
    Cached audio files are now named with random UUIDs instead of sequential names like temp-1, improving uniqueness and preventing potential naming conflicts.

  • PortAudio Dependency Update
    PortAudio is no longer required if the local microphone is not in use (e.g., when running in a headless container with no mic). Previously, the program would throw an error even if all we need is a web server and a local microphone wasn't necessary. Now, it only throws an error if microphone functionality is explicitly needed locally (e.g. when running the main.py).

New Contributors

A big thanks to our newest contributor:

Full Changelog: Compare v0.2.3...v0.2.4


This release note was enhanced with GPT-4o, which is why it sounds so professional.