Downloading AudioCaps

AudioCaps Description

There are 4 columns in the csv files.

audiocap_id: The id unique to the audio clips and its corresponding caption.
youtube_id: The youtube clip that the audio belongs to. You can use this to obtain the VGGish embedding from AudioSet.
start_time: The start time of the clip.
caption: The audio caption.

Last edit: Jan 30, 2023

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
data		data
LICENSE		LICENSE
README.md		README.md
download.py		download.py
errors.py		errors.py
requirements.txt		requirements.txt