Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dataset processing #6

Open
nibandan123 opened this issue Apr 16, 2020 · 3 comments
Open

Dataset processing #6

nibandan123 opened this issue Apr 16, 2020 · 3 comments

Comments

@nibandan123
Copy link

Thank you for sharing your code. But I still have two problems with data processing.
1.In your data processing code, I didn't understand how you specifically process the dataset. I also don't understand which part of the code is to process video into image sequences and audio frames. Can you tell me how to process the LRW dataset?
2. In "LRW_training_list_npy_128.txt" , What do the last numbers mean in each line? (43,69,83,109,0)

@chenruyi
Copy link

I have some question, do you have answers?

@chenruyi
Copy link

chenruyi commented Jul 30, 2020

I read the code, and may find the answers.
From the trainList_LRW.py
each line meaning:
face, audio, mouth_roi(x1,y1,x2,y2), word_label_index
so, (43,69,83,109) -> mouth roi
0 -> "ABOUT", word label

@nibandan123
Copy link
Author

Thank you. I also read the code these days. I come to the same conclusion as you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants