You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for sharing your code. But I still have two problems with data processing.
1.In your data processing code, I didn't understand how you specifically process the dataset. I also don't understand which part of the code is to process video into image sequences and audio frames. Can you tell me how to process the LRW dataset?
2. In "LRW_training_list_npy_128.txt" , What do the last numbers mean in each line? (43,69,83,109,0)
The text was updated successfully, but these errors were encountered:
I read the code, and may find the answers.
From the trainList_LRW.py
each line meaning:
face, audio, mouth_roi(x1,y1,x2,y2), word_label_index
so, (43,69,83,109) -> mouth roi
0 -> "ABOUT", word label
Thank you for sharing your code. But I still have two problems with data processing.
1.In your data processing code, I didn't understand how you specifically process the dataset. I also don't understand which part of the code is to process video into image sequences and audio frames. Can you tell me how to process the LRW dataset?
2. In "LRW_training_list_npy_128.txt" , What do the last numbers mean in each line? (43,69,83,109,0)
The text was updated successfully, but these errors were encountered: