Skip to content

thowitz/youtube-transcript-searcher

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

Youtube transcript searcher

About

  • This is small cli program that searches the transcripts of videos in a youtube channel for a specific term
  • I stand on the shoulders of giants, 3 amazing open source projects did most of the work, I just mushed them together. Check out the acknowledgements and give them some love :)

Usage

One liner

  • -t {search term}
  • -u {channel username}
  • -n {number of videos to search}

Prompt

  • Any options omitted will be prompted for
  • So just run the script without any options

How it works

  1. The scrapetube library is used to get a list of the video ids, where the length is the number of videos to search for
  2. The transcripts for the videos in the video ids list are then retrieved using the youtube transcript api library
  3. Each transcript of each video is checked for the search term and any occurrences are stored
  4. At the end, the id of each video containing any occurrences is printed along with the text and start position of every occurrence

Languages

Currently, only english is supported, if there's enough interest, I might add translation/support for other languages

Acknowledgements

Youtube libraries

These libraries were essential to the project, allowing me to get all the data I needed without having to use selenium or the annoying youtube api

Cli

  • Typer is a brilliant library for easily creating clis, I was able to add all the user interaction I wanted in like 5 minutes

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages