Ego4D subset #8

G-JWLee · 2022-12-18T16:30:39Z

Hello, I read the paper, and it says that you used FHO subset of Ego4D dataset.
However, it seems that there are only about 1700 video clips in FHO subset, and your training batch size is 512.
Plus, it seems that there are some video clips that do not contain audio modality.
Hence, the code repeatedly search for videoes that contain audio and gain single sample.
Therefore, it takes so long time to consist single batch.
Is there something wrong in my understanding? or you just rely on high num_workers?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Ego4D subset #8

Ego4D subset #8

G-JWLee commented Dec 18, 2022

Ego4D subset #8

Ego4D subset #8

Comments

G-JWLee commented Dec 18, 2022