Is the full_scale video data (5TB) needed for the VQ2D task? #3

fcakyon · 2022-03-27T20:47:04Z

Thanks for this wonderful work!

How to reduce the download size if I want to work only for VQ2D task?

Command given here downloads more than 5TB data: https://github.com/EGO4D/episodic-memory/blob/main/VQ2D/README.md#running-experiments

miguelmartin75 · 2022-03-28T04:18:03Z

You should be able to just download the subset required for EM VQ by providing the --benchmarks em flag to the CLI (see here). This will download more videos than necessary (as it's for the entire EM) - sitting at 2.75TB, e.g.

python3 -m ego4d.cli.cli --output_directory=<dir> --dataset full_scale --benchmark vq

If you want to download less, I would reccomend passing in the video uids to download via --video_uid_file, with the video uids derived from the annotation JSON files.

There are also canonical clips. These are clips specific to the benchmark task and are subsets of the videos. For VQ they are ~5FPS clips with frames for where there are annotations. They are much less in size, sitting at around 700GB (for all of EM).

cc @ebyrne

fcakyon · 2022-04-04T08:22:07Z

@miguelmartin75 thanks for the response! What is the purpose of canonical clips? Should use them training my proposed model? Or would it result in suboptimal training?

ramkumarkoppu mentioned this issue Apr 3, 2022

Is there any way to just download the NLQ subset? #8

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the full_scale video data (5TB) needed for the VQ2D task? #3

Is the full_scale video data (5TB) needed for the VQ2D task? #3

fcakyon commented Mar 27, 2022 •

edited

Loading

miguelmartin75 commented Mar 28, 2022 •

edited

Loading

fcakyon commented Apr 4, 2022

Is the full_scale video data (5TB) needed for the VQ2D task? #3

Is the full_scale video data (5TB) needed for the VQ2D task? #3

Comments

fcakyon commented Mar 27, 2022 • edited Loading

miguelmartin75 commented Mar 28, 2022 • edited Loading

fcakyon commented Apr 4, 2022

fcakyon commented Mar 27, 2022 •

edited

Loading

miguelmartin75 commented Mar 28, 2022 •

edited

Loading