Skip to content

t46/video-dataset

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 

Repository files navigation

video-dataset

Quick Start

Download HD-VILA-100M Dataset

Extract URLs and Save as CSV

cd preprocess
python extract_url.py

Get Videos and Metadata with video2dataset

video2dataset \\
--url_list="results/all_urls.csv"  \\
--url_col="url" \\
--output_folder="dataset"

Change Videos to 1fps

cd preprocess
./change_fps.sh
./copy_json.sh

Split Videos & Subtitles and Save them as StreamingDataset

Option1

cd preprocess
python video_to_mds.py

Option2

cd preprocess
python video_to_mds_pre_split.py

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published