Data Preprocessing Human3.6M + Adaptation for different skeleton #3

StevRamos · 2022-01-11T05:50:46Z

How did you preprocess the Human3.6M dataset? I would like to replicate npy and pkl files that you provide. Do you have a code of these? Thanks in advance!

DegardinBruno · 2022-01-11T10:05:29Z

Hi @StevRamos, thanks for your interest in Kinetic-GAN!
For consistency purposes, we obtained the same data as previous methods. Authors of SA-GCN (“Structure-Aware Human-ActionGeneration”) provided us their data obtained by the other methods also. Their GitHub: https://github.com/PingYu-iris/SA-GCN

We just rearranged it to be easier to use!
Let me know if you have any further question.

StevRamos · 2022-01-11T18:17:30Z

Thanks for the prompt response! I will review it.

I would like to use your model to generate new videos of sign language (for data augmentation purposes). The problem is the dataset I have is a set of videos. I recently learned a little bit of GNN so as I understand each node has features. It would be really good if you can tell me if it is possible to get (replicate) these features in the nodes of each video in my dataset (sign language videos) or if I need to have other tools to make it possible, and what these features represent.

You did an amazing work! Thanks for making the code public!

DegardinBruno · 2022-01-11T22:20:24Z

Thank you very much! Btw, the content/shape of each dataset is N x C x T x V (x M), where N is the number of samples, C the number of coordinates, T temporal instances (frames) and V the number of joints. M is usually 1 if there is a fifth dimension.

Yes absolutely, great idea, you can even make your own conditional model with Kinetic-GAN to generate specific words and letters, you just need to extract it's 2D or 3D hand pose estimation first!
After that, you will need to define/change it's adjacency matrix (V x V matrix, where V is the number of joints in the hand, where connected joints have 1 otherwise 0) by changing the connected joints in the data (check graph_ntu.py file).
Then, you define/change the upsampling and downsampling path (check also graph_ntu.py file). There are some comments there where you can visualize the upsampling paths just by testing that code!

StevRamos · 2022-01-12T05:54:23Z

Thanks you very much @DegardinBruno . That helps me a lot! So the information I need are the coordinates for each joint (in each timestep). I will get into the code. I think it is promissing!

Just to clarify, I have some questions.

Should all the frames in the video have the same number of joints?
What do you mean by local and global movement?
What the dimension resolution level L (paper) means? (I think you refer in this issue as M)

Again, thanks in advance!

DegardinBruno · 2022-01-12T09:29:22Z

Should all the frames in the video have the same number of joints?

Yes, at this point, Kinetic-GAN only supports a fixed number of joints through all frames.

What do you mean by local and global movement?

Check our video at 0:27s. In local movement, the skeleton is normalized to a root joint, and on the other hand, global movement describes the skeleton moving freely without constraints.

What the dimension resolution level L (paper) means? (I think you refer in this issue as M)

As you can see in figure 4 (paper), we define our upsampling path with four levels where level 1 is a single point from the latent space and level 4 is the complete skeleton from the respective dataset.

M represents something different! In NTU RGB+D sometimes they have 2 skeletons in each data sample, that's where M come from. However, Kinetic-GAN still does not support action interaction between two samples.

StevRamos · 2022-05-03T03:28:38Z

Hi @DegardinBruno, I was using your model as I told you months ago. It worked! but now I would like to use it with other graph structure. When I tried this time, I got an error. Basically, it is because of the assertion (assert len(self.center) == self.lvls). That's why I want to know what is the notion behind the algorithm shown in

Kinetic-GAN/models/init_gan/graph_ntu.py

Lines 56 to 89 in b5d8d4d

    
           for _ in range(self.lvls-1): 
        
               stay  = [] 
        
               start = 1 
        
               while True: 
        
                   remove = [] 
        
                   for i in G: 
        
                       if len(G.edges(i)) == start and i not in stay: 
        
                           lost = [] 
        
                           for j,k in G.edges(i): 
        
                               stay.append(k) 
        
                               lost.append(k) 
        
                           recon = [(l,m) for l in lost for m in lost if l!=m] 
        
                           G.add_edges_from(recon)             
        
                           remove.append(i) 
        
                   if start>10: break  # Remove as maximum as possible 
        
                   G.remove_nodes_from(remove) 
        
                   cycle = nx.cycle_basis(G)  # Check if there is a cycle in order to downsample it 
        
                   if len(cycle)>0: 
        
                       if len(cycle[0])==len(G): 
        
                           last = [x for x in G if x not in stay] 
        
                           G.remove_nodes_from(last) 
        
                   start+=1 
        
               map_i = np.array([[i, x] for i,x in enumerate(G)])  # Keep track graph indices 
        
               self.map.append(map_i) 
        
               mapping = {}  # Change mapping labels 
        
               for i, x in enumerate(G):  
        
                   mapping[int(x)] = i 
        
                   if int(x)==self.center[-1]: 
        
                       self.center.append(i)

. If you could explain me the idea with pseudo-code, I would appreciate it so much. Thanks in advance!

Stev

DegardinBruno · 2022-05-03T16:29:41Z

Hey @StevRamos, great!!

It would be best if you changed the neighbor_base with the connections of your skeleton structure.
Uncomment the lines before the assertions to visualise your graph levels!

If you could explain me the idea with pseudo-code, I would appreciate it so much.

We are basically removing edges, letting at least one parent in the graph for the next level because you can't just remove edges since it will become inconsistent.

hendrikTpl · 2023-06-08T02:38:26Z

Should all the frames in the video have the same number of joints?

Yes, at this point, Kinetic-GAN only supports a fixed number of joints through all frames.

What do you mean by local and global movement?

Check our video at 0:27s. In local movement, the skeleton is normalized to a root joint, and on the other hand, global movement describes the skeleton moving freely without constraints.

What the dimension resolution level L (paper) means? (I think you refer in this issue as M)

As you can see in figure 4 (paper), we define our upsampling path with four levels where level 1 is a single point from the latent space and level 4 is the complete skeleton from the respective dataset.

M represents something different! In NTU RGB+D sometimes they have 2 skeletons in each data sample, that's where M come from. However, Kinetic-GAN still does not support action interaction between two samples.

Hi @DegardinBruno, thanks for providing this code, btw I am working on to Human interaction generation, as you said it is not supported yet for interaction, would you please guide me and provide some note to make this possible? Recently I was working HIR (recognition only) now, I want to use your code and model to generate skeleton data (data augmentation) for small dataset. It would be great and much appreciate your help. Thanks

DegardinBruno changed the title ~~Data Preprocessing - Human3.6M~~ Data Preprocessing Human3.6M + Adaptation for different skeleton Jan 12, 2022

DegardinBruno added the question Further information is requested label Jan 13, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Data Preprocessing Human3.6M + Adaptation for different skeleton #3

Data Preprocessing Human3.6M + Adaptation for different skeleton #3

StevRamos commented Jan 11, 2022

DegardinBruno commented Jan 11, 2022

StevRamos commented Jan 11, 2022

DegardinBruno commented Jan 11, 2022 •

edited

Loading

StevRamos commented Jan 12, 2022

DegardinBruno commented Jan 12, 2022 •

edited

Loading

StevRamos commented May 3, 2022

DegardinBruno commented May 3, 2022

hendrikTpl commented Jun 8, 2023

Data Preprocessing Human3.6M + Adaptation for different skeleton #3

Data Preprocessing Human3.6M + Adaptation for different skeleton #3

Comments

StevRamos commented Jan 11, 2022

DegardinBruno commented Jan 11, 2022

StevRamos commented Jan 11, 2022

DegardinBruno commented Jan 11, 2022 • edited Loading

StevRamos commented Jan 12, 2022

DegardinBruno commented Jan 12, 2022 • edited Loading

StevRamos commented May 3, 2022

DegardinBruno commented May 3, 2022

hendrikTpl commented Jun 8, 2023

DegardinBruno commented Jan 11, 2022 •

edited

Loading

DegardinBruno commented Jan 12, 2022 •

edited

Loading