ACFL: Adaptive Cross-Form Learning for Skeleton based Action Recognition

This repo is the official implementation for ACFL, which is based on CTR-GCN repo.

Introduction

In this work, we are interested in the skeleton based action recognition with a focus on learning paradigm. Most existing methods tend to improve GCNs by leveraging multi-form skeletons due to their complementary cues. However, these methods (either adapting structure of GCNs or model ensemble) require the co-existence of all forms of skeletons during both training and inference stages, while a typical situation in real life is the existence of only partial forms for inference. To tackle this issue, we present Adaptive Cross-Form Learning (ACFL), which empowers well-designed GCNs to generate complementary representation from single-form skeletons without changing model capacity. Specifically, each GCN model in ACFL not only learns action representation from the single-form skeletons, but also adaptively mimics useful representations derived from other forms of skeletons. In this way, each GCN can learn how to strengthen what has been learned, thus exploiting model potential and facilitating action recognition as well. We empirically demonstrate the effectiveness of our method through the superior action classification results over three benchmark datasets: the NTU RGB-D 120 dataset, the NTU RGB-D 60 dataset and the UAV dataset.

Main Results

Adopting CTR-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Settings	Input Form	NTU-RGB-D 120 X-Sub	NTU-RGB-D 120 X-Set	NTU-RGB-D 60 X-Sub	NTU-RGB-D 60 X-View	UAV X-Sub
CTR-GCN	Joint	84.9	86.5	89.6	94.5	41.7
CTR-GCN w ACFL	Joint	87.3	88.7	91.2	96.4	43.8
CTR-GCN	Bone	85.7	87.4	90.2	94.9	41.0
CTR-GCN w ACFL	Bone	88.4	89.5	91.4	96.4	43.3
CTR-GCN	Joint & Bone	86.9	88.8	90.9	95.6	42.4
CTR-GCN w ACFL	Joint & Bone	89.3	90.2	92.0	96.6	44.2

Adopting Shift-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Settings	Input Form	NTU-RGB-D 120 X-Sub	NTU-RGB-D 120 X-Set	NTU-RGB-D 60 X-Sub	NTU-RGB-D 60 X-View	UAV X-Sub
Shift-GCN	Joint	82.8	84.3	88.0	93.6	41.3
Shift-GCN w ACFL	Joint	85.1	85.9	90.0	94.1	43.2
Shift-GCN	Bone	83.7	84.2	89.6	92.4	40.5
Shift-GCN w ACFL	Bone	85.5	86.7	90.3	93.9	41.6
Shift-GCN	Joint & Bone	85.4	86.2	89.5	93.4	41.2
Shift-GCN w ACFL	Joint & Bone	86.9	87.8	90.5	94.5	42.6

Adopting MS-G3D as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Settings	Input Form	NTU-RGB-D 120 X-Sub	NTU-RGB-D 120 X-Set	NTU-RGB-D 60 X-Sub	NTU-RGB-D 60 X-View	UAV X-Sub
MS-G3D	Joint	85.4	87.2	89.7	94.7	42.2
MS-G3D w ACFL	Joint	87.3	88.7	90.3	94.7	43.3
MS-G3D	Bone	86.7	87.6	90.5	94.5	41.7
MS-G3D w ACFL	Bone	88.1	89.0	91.0	95.5	43.3

Data Preparation

NTU RGB+D 60 and 120

Request dataset here: http://rose1.ntu.edu.sg/Datasets/actionRecognition.asp
Download the skeleton-only datasets:
1. nturgbd_skeletons_s001_to_s017.zip (NTU RGB+D 60)
2. nturgbd_skeletons_s018_to_s032.zip (NTU RGB+D 120)
3. Extract above files to ./data/nturgbd_raw

Data Processing

Directory Structure

Put downloaded data into the following directory structure:

- data/
  - ntu/
  - ntu120/
  - UAV/
  - nturgbd_raw/
    - nturgb+d_skeletons/     # from `nturgbd_skeletons_s001_to_s017.zip`
      ...
    - nturgb+d_skeletons120/  # from `nturgbd_skeletons_s018_to_s032.zip`
      ...

Generating Data

Generate NTU RGB+D 60 or NTU RGB+D 120 dataset:

 cd ./data/ntu # or cd ./data/ntu120
 # Get skeleton of each performer
 python get_raw_skes_data.py
 # Remove the bad skeleton 
 python get_raw_denoised_data.py
 # Transform the skeleton to the center of the first frame
 python seq_transformation.py

Download pretrained models from our model zoo [Google] [Baidu (ns51)]

   ${ACFL_ROOT}
    `-- source_models
       | -- CTR-GCN
       |    | -- ctrgcn_joint_ntu120_xsub.pt
       |    | -- ctrgcn_bone_ntu120_xsub.pt
       |    | -- ctrgcn_joint_bone_ntu120_xsub.pt
       |    | -- ...
       | -- Shift-GCN
       |    | -- shiftgcn_joint_ntu120_xsub.pt
       |    | -- shiftgcn_bone_ntu120_xsub.pt
       |    | -- shiftgcn_joint_bone_ntu120_xsub.pt
       |    | -- ...
     -- acfl_models
       | -- CTR-GCN
       |    | -- ctrgcn_joint_ntu120_xsub.pt
       |    | -- ctrgcn_bone_ntu120_xsub.pt
       |    | -- ctrgcn_joint_bone_ntu120_xsub.pt
       |    | -- ...
       | -- Shift-GCN
       |    | -- shiftgcn_joint_ntu120_xsub.pt
       |    | -- shiftgcn_bone_ntu120_xsub.pt
       |    | -- shiftgcn_joint_bone_ntu120_xsub.pt
       |    | -- ...

Training & Testing

Training

Change the config file depending on what you want. Here, we provide three ways to train GCN models

# Example: training CTR-GCN via standard single form representation learning (SFRL) on NTU RGB+D 120 cross subject with GPU 0 1 2 3
python standard_train_net.py --config config/nturgbd120-cross-subject/baselines/baseline_ctrgcn.yaml --work-dir work_dir/ntu120/csub/baseline_ctrgcn --device 0 1 2 3

# Example: training CTR-GCN via On-line ACFL on NTU RGB+D 120 cross subject with GPU 0 1 2 3
python online_acfl_train_net.py --config config/nturgbd120-cross-subject/ACFL/online_acfl_ctrgcn.yaml --work-dir work_dir/ntu120/csub/online_acfl_ctrgcn --device 0 1 2 3

# Example: training CTR-GCN via Off-line ACFL on NTU RGB+D 120 cross subject with GPU 0 1 2 3
python offline_acfl_train_net.py --config config/nturgbd120-cross-subject/ACFL/offline_acfl_ctrgcn.yaml --work-dir work_dir/ntu120/csub/offline_acfl_joint_ctrgcn --device 0 1 2 3

Testing

To test the trained models saved in <work_dir>, run the following command:

# Example: testing CTR-GCN trained via Off-line ACFL on NTU RGB+D 120 cross subject
python standard_train_net.py --config config/nturgbd120-cross-subject/baselines/baseline_ctrgcn.yaml --work-dir work_dir/ntu120/csub/offline_acfl_joint_ctrgcn --phase test --weights work_dir/ntu120/csub/offline_acfl_joint_ctrgcn/xxx.pt --device 0

Acknowledge

This repo is mainly based on CTR-GCN.

Thanks to the orginal authors for their contribution.

Citation

@inproceedings{WangACFL,
  title = {Skeleton-based Action Recognition via Adaptive Cross-Form Learning},
  author={Xuanhan Wang and Yan Dai and Lianli Gao and Jingkuan Song},
  booktitle = {ACM MM},
  year={2022}
}

@article{wangacflsuppl,
  author = {Wang, Xuanhan and Dai, Yan and Gao, Lianli and Song, Jingkuan},
  title = {Skeleton-based Action Recognition via Adaptive Cross-Form Learning},
  journal={arXiv preprint arXiv:2206.15085},
  year = {2022}
}

Name		Name	Last commit message	Last commit date
Latest commit History 30 Commits
config		config
feeders		feeders
figures		figures
graph		graph
model		model
torchlight		torchlight
README.md		README.md
offline_acfl_train_net.py		offline_acfl_train_net.py
online_acfl_train_net.py		online_acfl_train_net.py
requirements.txt		requirements.txt
standard_train_net.py		standard_train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

ACFL: Adaptive Cross-Form Learning for Skeleton based Action Recognition

Introduction

Main Results

Adopting CTR-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Adopting Shift-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Adopting MS-G3D as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Data Preparation

NTU RGB+D 60 and 120

Data Processing

Directory Structure

Generating Data

Download pretrained models from our model zoo [Google] [Baidu (ns51)]

Training & Testing

Training

Testing

Acknowledge

Citation

About

Releases

Packages

Languages

stoa-xh91/ACFL

Folders and files

Latest commit

History

Repository files navigation

ACFL: Adaptive Cross-Form Learning for Skeleton based Action Recognition

Introduction

Main Results

Adopting CTR-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Adopting Shift-GCN as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Adopting MS-G3D as the baseline model, evaluation results on NTU RGB-D 120, NTU RGB-D 60, UAV.

Data Preparation

NTU RGB+D 60 and 120

Data Processing

Directory Structure

Generating Data

Download pretrained models from our model zoo [Google] [Baidu (ns51)]

Training & Testing

Training

Testing

Acknowledge

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages