An intelligent system that automatically generates engaging podcast conversations using LLMs and text-to-speech technology.
- Two modes of operation:
- Research mode: Automated research and content gathering using Tavily search
- Context mode: Generate podcasts from provided source materials (URLs and files)
- Dynamic podcast outline generation
- Natural conversational script writing with multiple Q&A rounds
- High-quality text-to-speech synthesis using Google Cloud or ElevenLabs
- Checkpoint system to save progress and resume generation
- Configurable voices and audio settings
- Gradio UI
Listen to sample podcasts generated using Podcast-LLM:
-
Install using pip:
pip install podcast-llm
-
Set up environment variables in
.env
:OPENAI_API_KEY=your_openai_key GOOGLE_API_KEY=your_google_key ELEVENLABS_API_KEY=your_elevenlabs_key TAVILY_API_KEY=your_tavily_key ANTHROPIC_API_KEY=your_anthropic_api_key
-
Generate a podcast about a topic:
# Research mode (default) - automatically researches the topic podcast-llm "Artificial Intelligence" # Context mode - uses provided sources podcast-llm "Machine Learning" --mode context --sources paper.pdf https://example.com/article
-
Options:
# Customize number of Q&A rounds per section podcast-llm "Linux" --qa-rounds 3 # Disable checkpointing podcast-llm "Space Exploration" --checkpoint false # Generate audio output podcast-llm "Quantum Computing" --audio-output podcast.mp3 # Generate Markdown output podcast-llm "Machine Learning" --text-output podcast.md
-
Customize voices and other settings in
config/config.yaml
-
Launch the Gradio web interface:
# Start the web UI podcast-llm-gui
This launches a user-friendly web interface where you can:
- Enter a podcast topic
- Choose between research and context modes
- Upload source files and URLs for context mode
- Configure Q&A rounds and checkpointing
- Specify output paths for text and audio
- Monitor generation progress in real-time
This project is licensed under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)
This means you are free to:
- Share: Copy and redistribute the material in any medium or format
- Adapt: Remix, transform, and build upon the material
Under the following terms:
- Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made
- NonCommercial: You may not use the material for commercial purposes
- No additional restrictions: You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits
For commercial use, please contact [email protected] to obtain a commercial license.
The full license text can be found at: https://creativecommons.org/licenses/by-nc/4.0/legalcode
This project was inspired by podcastfy, which provides a framework for generating podcasts using LLMs.
This implementation differs by automating the research and content gathering process, allowing for fully autonomous podcast generation about any topic without requiring manual research or content curation.