Add PDF to Audio and Audio to Audio #37

Josh-XT · 2024-04-06T22:47:31Z

Add PDF to Audio and Audio to Audio

Sending a pdf to the /v1/audio/speech endpoint will return audio of the full PDF.
Sending audio to the /v1/audio/speech endpoint will return audio in the voice selected.

import openai

openai.base_url = "http://localhost:8091/v1/"
openai.api_key = "your api key"
pdf_path = "C:\\book.pdf"
with open(pdf_path, "rb") as file:
    base64_encoded_pdf = base64.b64encode(file.read()).decode("utf-8")
base64_output = f"data:application/pdf;base64,{base64_encoded_pdf}"
# If it is an audio file, it would be data:audio/wav;base64,.......
tts_response = openai.audio.speech.create(
    model="tts-1",
    voice="Morgan_Freeman",
    input=base64_output,
    user="Title of audio",
)
# tts_response will be a URL with the audio. Depending on size of PDF, this will take awhile.
print(tts_response)

Josh-XT added 2 commits April 6, 2024 18:46

Add PDF to Audio

46f9d74

Add audio to audio

4581f2f

Josh-XT changed the title ~~Add PDF to Audio~~ Add PDF to Audio and Audio to Audio Apr 6, 2024

Josh-XT added 3 commits April 6, 2024 19:31

Automated chunking

b6deacf

Set to 700 length

b965708

Clean up

3d244ff

Josh-XT merged commit 0f6bd7c into main Apr 7, 2024
2 checks passed

Josh-XT deleted the pdf-to-audio branch April 7, 2024 00:07

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add PDF to Audio and Audio to Audio #37

Add PDF to Audio and Audio to Audio #37

Josh-XT commented Apr 6, 2024 •

edited

Loading

Add PDF to Audio and Audio to Audio #37

Add PDF to Audio and Audio to Audio #37

Conversation

Josh-XT commented Apr 6, 2024 • edited Loading

Josh-XT commented Apr 6, 2024 •

edited

Loading