It is an AI Assistant application that can recognize speech, interact with users, and provide text-to-speech responses. It uses Google's Gemini-Pro model for generating text responses.
- Speech recognition using the Google Speech Recognition API.
- Interaction with the Falcon model for generating responses.
- Text-to-speech conversion using the gTTS (Google Text-to-Speech) library.
- Download the Packages EXE's ZIP file
- Unzip the file.
- Run the main.exe file. It will open a console/terminal window.
- Clone this repository to your system.
cd
into the repository files.- Run this:
./build/exe.linux-x86_64-3.10/main
-
Run the unzipped executable or the
main.py
script. -
When the application is running, speak to the AI Assistant. The Assistant will send your speech to the Gemini model for generating a response.
-
The generated text response will be displayed, and the AI Assistant will also convert it to speech and play it.
You can customize the behavior of the AI Assistant by modifying the code in main.py
and the options in setup.py
.
If you'd like to contribute to this project, please follow these steps:
- Fork the repository.
- Create a new branch for your feature or bug fix.
- Make your changes.
- Test your changes to ensure they work as expected.
- Submit a pull request with your changes.
This project is licensed under the MIT License. You are free to use, modify, and distribute it as you see fit.