Skip to content

Latest commit

 

History

History
82 lines (60 loc) · 3.65 KB

CHANGELOG.md

File metadata and controls

82 lines (60 loc) · 3.65 KB

Changelog

[0.4.0] - 2024-11-16

Added

  • Add Google Singlespeaker (Journey) and Multispeaker TTS models
  • Fixed limitations of Google Multispeaker TTS model: 5000 bytes input limite and 500 bytes per turn limit.
  • Updated tests and docs accordingly

[0.3.6] - 2024-11-13

Added

  • Add longform podcast generation support
    • Users can now generate longer podcasts (20-30+ minutes) using the --longform flag in CLI or longform=True in Python API
    • Implements "Content Chunking with Contextual Linking" technique for coherent long-form content
    • Configurable via max_num_chunks and min_chunk_size parameters in conversation config
    • word_count parameter removed from conversation config as it's no longer used

[0.3.3] - 2024-11-08

Breaking Changes

  • Loading images from 'path' has been removed for security reasons. Please specify images by passing an 'url'.

Added

  • Add podcast generation from topic "Latest News in U.S. Politics"
  • Integrate with 100+ LLM models (OpenAI, Anthropic, Google etc) for transcript generation
  • Integrate with Google's Multispeaker TTS model for high-quality audio generation
  • Deploy REST API with FastAPI
  • Support for raw text as input
  • Add PRIVACY_POLICY.md
  • Start TESTIMONIALS.md
  • Add apps using Podcastfy to README.md

Fixed

  • #165 Fixed audio generation in Windows OS issue: Normalize path separators for cross-platform compatibility

[0.2.3] - 2024-10-15

Added

  • Add local llm option by @souzatharsis
  • Enable running podcastfy with no API KEYs thanks to solving #18 #58 #65 by @souzatharsis and @ChinoUkaegbu
  • Add user-provided TSS config such as voices #10 #6 #27 by @souzatharsis
  • Add open in collab and setting python version to 3.11 by @Devparihar5 #57
  • Add edge tts support by @ChinoUkaegbu
  • Update pypdf with pymupdf(10x faster then pypdf) #56 check by @Devparihar5
  • Replace r.jina.ai with simple BeautifulSoap #18 by @souzatharsis

Fixed

  • Fixed CLI for user-provided config #69 @souzatharsis

[0.2.2] - 2024-10-13

Added

Fixed

  • (#52) Fixed simple bug introduced in 0.2.1 that broke the ability to generate podcasts from text inputs!
  • Fixed one example in the documentation that was not working.

[0.2.1] - 2024-10-12

Added

  • (#8) Podcastfy is now multi-modal! Users can now generate audio from images by simply providing the paths to the image files.

Fixed

  • (#40) Updated default ElevenLabs voice from BrittneyHart to Jessica. The latter was a non-default voice I used from my account, which caused error for users who don't have it.

[0.2.0] - 2024-10-10

Added

  • Parameterized podcast generation with Conversation Configuration (#11, #3, #4)
  • LangChain integration for improved LLM interface and observability (#29)
  • Changelog to track version updates (#22)
  • Tests for Customized conversation scenarios

Fixed

  • CLI now correctly reads from user-provided local .env file (#37)