Can you share your HowTo100M.csv file? #30

ShinJQ · 2022-04-20T08:53:54Z

Hi!
From your paper and readme.md file on (https://github.com/microsoft/UniVL)/dataloaders/, I could infer that the csv file you've used differ from the original csv file.

It is mentioned that 1.2M videos are used for pretraining.
Can you share the csv file that contains the id of1.2M video you've used for pretraining?

ArrowLuo · 2022-04-20T12:04:50Z

Hi @ShinJQ, I am afraid that I can not share the file now. The official CSV contains about 1.2M video ids so you can generate the HowTo100M.csv easily. Best.

HuBot2020 · 2022-05-23T20:53:34Z

Hi @ArrowLuo , sorry to bother, but I'm noticing that the pre-extracted feature files for HowTo100M supplied by the dataset owner have a file extension of '.mp4.npy' while in the README for uniVL you have the feature file with file extension '.npy'. Is this of any concern? Did you and your team do any extra processing for the feature files to get the '.npy' file extension for feature files?

ArrowLuo · 2022-05-24T01:02:54Z

Hi @HuBot2020, It is ok for the postfix of the feature filename, e.g., '.mp4.npy' or '.npy'. We have no other processing on the extracted feature. But I do not know what the difference is between the '.mp4.npy shared by the dataset owner and ours.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you share your HowTo100M.csv file? #30

Can you share your HowTo100M.csv file? #30

ShinJQ commented Apr 20, 2022

ArrowLuo commented Apr 20, 2022

HuBot2020 commented May 23, 2022

ArrowLuo commented May 24, 2022

Can you share your HowTo100M.csv file? #30

Can you share your HowTo100M.csv file? #30

Comments

ShinJQ commented Apr 20, 2022

ArrowLuo commented Apr 20, 2022

HuBot2020 commented May 23, 2022

ArrowLuo commented May 24, 2022