Podcast-LLM: AI-Powered Podcast Generation

An intelligent system that automatically generates engaging podcast conversations using LLMs and text-to-speech technology.

View Documentation

Features

Two modes of operation:
- Research mode: Automated research and content gathering using Tavily search
- Context mode: Generate podcasts from provided source materials (URLs and files)
Dynamic podcast outline generation
Natural conversational script writing with multiple Q&A rounds
High-quality text-to-speech synthesis using Google Cloud or ElevenLabs
Checkpoint system to save progress and resume generation
Configurable voices and audio settings
Gradio UI

Examples

Listen to sample podcasts generated using Podcast-LLM:

Structured JSON Output from LLMs (Google multispeaker voices)

UFO Crash Retrieval (Elevenlabs voices)

The Behenian Fixed Stars (Google multispeaker voices)

Podcast-LLM Overview (Google multispeaker voices)

Robotic Process Automation (Google voices)

Web Interface

Installation

Install using pip:
```
pip install podcast-llm
```

Set up environment variables in .env:

OPENAI_API_KEY=your_openai_key
GOOGLE_API_KEY=your_google_key 
ELEVENLABS_API_KEY=your_elevenlabs_key
TAVILY_API_KEY=your_tavily_key
ANTHROPIC_API_KEY=your_anthropic_api_key

Usage

Generate a podcast about a topic:

# Research mode (default) - automatically researches the topic
podcast-llm "Artificial Intelligence"

# Context mode - uses provided sources
podcast-llm "Machine Learning" --mode context --sources paper.pdf https://example.com/article

Options:

# Customize number of Q&A rounds per section
podcast-llm "Linux" --qa-rounds 3

# Disable checkpointing
podcast-llm "Space Exploration" --checkpoint false

# Generate audio output
podcast-llm "Quantum Computing" --audio-output podcast.mp3

# Generate Markdown output
podcast-llm "Machine Learning" --text-output podcast.md

Customize voices and other settings in config/config.yaml
Launch the Gradio web interface:
```
# Start the web UI
podcast-llm-gui
```
This launches a user-friendly web interface where you can:
- Enter a podcast topic
- Choose between research and context modes
- Upload source files and URLs for context mode
- Configure Q&A rounds and checkpointing
- Specify output paths for text and audio
- Monitor generation progress in real-time

License

This project is licensed under Creative Commons Attribution-NonCommercial 4.0 International (CC BY-NC 4.0)

This means you are free to:

Share: Copy and redistribute the material in any medium or format
Adapt: Remix, transform, and build upon the material

Under the following terms:

Attribution: You must give appropriate credit, provide a link to the license, and indicate if changes were made
NonCommercial: You may not use the material for commercial purposes
No additional restrictions: You may not apply legal terms or technological measures that legally restrict others from doing anything the license permits

For commercial use, please contact [email protected] to obtain a commercial license.

The full license text can be found at: https://creativecommons.org/licenses/by-nc/4.0/legalcode

Acknowledgements

This project was inspired by podcastfy, which provides a framework for generating podcasts using LLMs.

This implementation differs by automating the research and content gathering process, allowing for fully autonomous podcast generation about any topic without requiring manual research or content curation.

Name		Name	Last commit message	Last commit date
Latest commit History 66 Commits
.github		.github
assets/images		assets/images
docs		docs
podcast_llm		podcast_llm
tests		tests
.env.example		.env.example
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
poetry.lock		poetry.lock
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt
run_tests.sh		run_tests.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Podcast-LLM: AI-Powered Podcast Generation

Features

Examples

Structured JSON Output from LLMs (Google multispeaker voices)

UFO Crash Retrieval (Elevenlabs voices)

The Behenian Fixed Stars (Google multispeaker voices)

Podcast-LLM Overview (Google multispeaker voices)

Robotic Process Automation (Google voices)

Web Interface

Installation

Usage

License

Acknowledgements

About

Releases 6

Packages

Languages

License

evandempsey/podcast-llm

Folders and files

Latest commit

History

Repository files navigation

Podcast-LLM: AI-Powered Podcast Generation

Features

Examples

Structured JSON Output from LLMs (Google multispeaker voices)

UFO Crash Retrieval (Elevenlabs voices)

The Behenian Fixed Stars (Google multispeaker voices)

Podcast-LLM Overview (Google multispeaker voices)

Robotic Process Automation (Google voices)

Web Interface

Installation

Usage

License

Acknowledgements

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 6

Packages 0

Languages

Packages