1.0.99996 | Alpha 4 for 1.1 | Current: Version 1.1-A4 #114
Pinned
cyberofficial
announced in
Announcements
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Portable + GUI is at itch.io! Pick it up there.
Version 1.1-A4
Full Fixes located at #112
What's Changed
Changelog
Remote Microphone Server (Start of 1.1)
Ignore sending data to API if data is empty
Journey to 1.1
GUI Wrapper Update
Fixed minor error
Fix Win 2 Error For Portable Version [temporary]
Remote Microphone update and Stream Module Update
Improvements:
download_segment
function instream_transcription_module.py
to include more robust error handling and retries for downloading segments.max_retries
parameter to thedownload_segment
function to specify the number of retry attempts.requests.exceptions.RequestException
specifically for network-related errors, providing more informative error messages.retry_delay
to handle temporary network issues.http.client.IncompleteRead
exceptions, which can occur if the connection drops during download.load_m3u8_with_retry
function instream_transcription_module.py
has been enhanced to retry loading the M3U8 playlist file if there are errors, making it more resilient to network fluctuations.requests.exceptions.RequestException
andhttp.client.IncompleteRead
errors during playlist loading, retrying with a delay until successful.sounddevice
,soundfile
, andpydub
to requirements.txt for audio capture. Will be using a more native way to capture audio better.player.html
to address potential overlapping issues. This change ensures better visual clarity and prevents elements from overlapping, enhancing the user interface.Bug Fixes:
stream_transcription_module.py
:remote_microphone.py
:remote_microphone.py
. This change ensures that the generated HLS playlist is created correctly, addressing potential issues with live stream playback.Lock numpy to version 1.26.4
numpy
to version 1.26.4Update README.md
Version Bump
Additional Fixes
Full Changelog: 1.0.99995a3...1.0.99995a4
Version 1.1-A3
A2 -> A3
Synthalingua Change Log: Version 1.0.99995
This update focuses on improving the streaming functionality and user experience.
New Features:
--remote_hls_password_id
and--remote_hls_password
flags.Bug Fixes:
Other Changes:
Please note: This changelog highlights the key changes in version 1.0.99995. For a comprehensive list of all changes, please refer to the commit history on the GitHub repository.
ℹ️ Important information:
The streaming server for the microphone will record 30 1 second chunks. So if you want a 6 second record time, set to 6 chunks, or 3 seconds of audio set to 3 chunks. 1 Chunk = 1 Second
Synthalingua Remote Microphone Streaming and HLS Password Support
This update introduces two major features to Synthalingua: remote microphone streaming and support for password-protected HLS streams.
Remote Microphone Streaming
You can now stream audio from your microphone to a web server, enabling remote access for translation and transcription. This is particularly useful when:
To use remote microphone streaming:
Run the
remote_microphone.py
script on the machine with the microphone input or virtual audio input. This script creates a local web server that streams the microphone audio as an HLS stream. You will be prompted to select the microphone and choose a server port and stream key. For instance, if the script generates the stream key "your_secret_key", the HLS playlist URL would be:On the machine running Synthalingua, use the
--stream
flag with the HLS playlist URL:Remember to replace "your_secret_key" with the actual stream key generated by the
remote_microphone.py
script.Add the
--remote_hls_password_id
and--remote_hls_password
flags to provide the stream key ID and key and othe relevant arguments:This ensures that only authorized users with the correct stream key can access the audio stream. Which is hard coded enabled, there will be not option to disable this. Security first! This command does the following, Sets the VRAM usage to 2GB, Will show original text, Identify the input language as English so we don't need to transcribe or translate, sets the stream chunk to 6 (6 second audio recording time), using cuda (aka gpu) using a custom black list of phrases, Sets the webserver of Synthalingua to 2000 on the local machine, condition_on_previous_text will try to prevent spammed repeats of words, hls id is key, password is the password after the = sign.
HLS Stream Password Support
Synthalingua now supports HLS streams that require a password through url authentication, allowing you to access and translate/transcribe content from premium or protected sources that use hls passwords via direct url. Please remember not all HLS Password streams will not work like this, but this is still early alpha and will be improved upon later.
To use HLS stream password support also comes along the microphone server:
Obtain the HLS URL and password information. This might involve inspecting network requests or utilizing browser developer tools to identify the HLS playlist URL and any required authentication parameters.
Use the
--stream
flag with the HLS URL:Include the
--remote_hls_password_id
and--remote_hls_password
flags to provide the password ID and password:Replace "your_id" and "your_password" with the actual values required for authentication. The specific format and values for these parameters will vary depending on the stream provider.
Important Notes:
--remote_hls_password_id
and--remote_hls_password
flags when connecting to the microphone stream.remote_microphone.py
script is "key".I hope these new features enhance your experience with Synthalingua! Please feel free to report any issues or provide feedback on the GitHub repository.
This discussion was created from the release 1.0.99996 | Alpha 4 for 1.1 | Current: Version 1.1-A4.
Beta Was this translation helpful? Give feedback.
All reactions