Podify Studio is a Python application for generating podcasts by merging text-based dialogs into audio files. It provides functionalities to convert text to speech using different text-to-speech (TTS) engines, merge audio files, and generate podcasts seamlessly.
- Convert text to speech using gTTS (Google Text-to-Speech) and pyttsx3.
- Generate multiple audio files from text-based dialogs.
- Merge audio files to create a podcast.
-
Clone the repository:
git clone https://github.com/AniruddhaGawali/podify-studio
-
Navigate to the project directory:
cd podify-studio
-
Install the dependencies:
pip install -r requirements.txt
-
Install
You can open and save WAV files with pure python. For opening and saving non-wav files – like mp3 – you'll need ffmpeg or libav. (for pydub)
-
Prepare your text-based dialogs in a file, with each dialog formatted as
<name>:<text>
, separated by a delimiter (default is ';'). For example:Speaker 1: Hello, how are you?; Speaker 2: I'm fine, thank you!; Speaker 1: That's great!;
-
Use the provided Python scripts to generate audio files and merge them into a podcast:
# Convert text to audio files using gTTS python text_to_audio_gtt.py --input dialogs.txt --output_folder output # Merge audio files into a podcast python generate_podcast.py --input_folder output --output podcast.mp3