Custom Langchain tools to search/extract/transcribe Youtube videos
pip install -r requirements.txt
Consists of two custom LangChain tools, CustomYTSearchTool and CustomYTTranscribeTool
- CustomYTSearchTool searches for the youtube videos featuring a person name and returns the top num_results URLs
- CustomYTTranscribeTool transcribes the videos and outputs the text transcripts to transcripts.json
running python yt_tools.py shows you an agent run that transcribes the top 3 video results for Laszlo Bock (Google HR celebrity). you should see the results stored in transcripts.json in the current working directory.
you will be required to authenticate with your Youtube credentials the first time around
- clean up the mp4 files that the Transcribe tool creates between runs
- add threading/async capability to transcribe videos concurrently (may require not outputting transcripts to a single JSON file)