Urdu speech to image scene generator using numpy, pandas, and pandas3d visualizer library.
This project is a Python-based system that converts Urdu speech to a 3D scene. It includes the following functionalities:
- Online speech recognition using Google's Speech Recognition API
- Converting the Urdu script to Roman script for processing
- Preprocessing the text data, such as replacing similar words and pronouns
- Generating a 3D scene based on the processed text data, using Panda3D
- Python 3.6+
- pip package installer
- Sounddevice, scipy, wavio, spacy, pandas, numpy, and other Python libraries (refer to requirements.txt)
- Clone the repository to your local machine using Git:
git clone https://github.com/<username>/<repository>.git
- Change the directory to the cloned repository:
cd <repository>
- Install the required packages using pip:
pip install -r requirements.txt
- Run the following command to start the system:
python main.py
- Follow the on-screen instructions to speak the Urdu text into the microphone. (5-second timer, start from "hey")
- The system will process the spoken text and generate a 3D scene based on the processed data.
If you have any suggestions, please open an issue on the GitHub repository or create a pull request.