Skip to content

Latest commit

 

History

History
103 lines (80 loc) · 4.83 KB

README.md

File metadata and controls

103 lines (80 loc) · 4.83 KB

Multi-Transmotion:
Pre-trained Model for Human Motion Prediction

Yang Gao, Po-Chien Luan, Alexandre Alahi

Annual Conference on Robot Learning (CoRL), München, November 2024

[Paper] [Website] [Poster] [Slides]

Abstract

The ability of intelligent systems to predict human behaviors is crucial, particularly in fields such as autonomous vehicle navigation and social robotics. However, the complexity of human motion have prevented the development of a standardized dataset for human motion prediction, thereby hindering the establishment of pre-trained models. In this paper, we address these limitations by integrating multiple datasets, encompassing both trajectory and 3D pose keypoints, to propose a pre-trained model for human motion prediction. We merge seven distinct datasets across varying modalities and standardize their formats. To facilitate multimodal pre-training, we introduce Multi-Transmotion, an innovative transformer-based model designed for cross-modality pre-training. Additionally, we present a novel masking strategy to capture rich representations. Our methodology demonstrates competitive performance across various datasets on several downstream tasks, including trajectory prediction in the NBA and JTA datasets, as well as pose prediction in the AMASS and 3DPW datasets.

Getting Started

Install the requirements using pip:

pip install -r requirements.txt

Unified Human Motion Data Framework

The framework is using modified trajnet++ to generate specific tabular format in '.ndjson' files. Specifically, the first part of '.ndjson' file represents trainable scenes/samples and the second part denotes the detailed human motion information. There are 207 columns of comprehensive human data are listed below:

  • [0]: Frame ID

  • [1]: Agents ID

  • [2:4]: Trajectory (x, y)

  • [4:8]: 3D Bounding Box (h, w, l, rot_z)

  • [8:12]: 2D Bounding BOx (bb_left, bb_top, bb_width, bb_height)

  • [12:129]: 3D pose keypoitns (x0,x2, ..x38, y0, y2, ..y38, z0, z2, ..z38)

  • [129:207]: 3D pose keypoitns (xx0,xx2, ..xx38, yy0, yy2, ..yy38)

    The missing data will be denoted as 'null'.

Here we will show an example to convert the NBA dataset into the Unified Human Motion Dataframework. We have conveniently added the data of NBA to the release section of the repository (for license details, please refer to the original papers).

  1. Download the data from our release and place the raw data subdirectory of NBA under extract_data/NBA/raw/
  2. Extract NBA data
    python extract_data/NBA/extract_NBA.py --split train
    
  3. Convert extracted data into UniHuMotion framework.
    mv extract_data/NBA/output_csv/nba_train.csv UniHuMotion_trajnetpp/data/UniHuMotion_NBA/
    cd UniHuMotion_trajnetpp
    python -m trajnetdataset.convert --acceptance 1.0 1.0 1.0 1.0 --train_fraction 1.0 --val_fraction 0.0 --fps 5 --obs_len 10 --pred_len 20 --chunk_stride 1
    
  4. Generate trainable data cache (Efficient dataloading).
    mv output_pre UHM_NBA
    mv UHM_NBA ../data/UniHuMotion/
    cd ..
    python UniHuMotion_cache/cache_generator.py --UniHuMotion_dataset UHM_NBA
    

Pre-training and fine-tuning

The code is using pytorch DDP so that it can be easily deployed on multiple gpus.

Demo command for pre-training:

python pretrain/train.py --cfg pretrain/configs/UniHuMotion.yaml --exp_name test_git

Demo command for fine-tuning on NBA dataset:

python ft_NBA/train.py --cfg ft_NBA/configs/UniHuMotion.yaml --exp_name default

Demo command for evaluation on NBA dataset:

python ft_NBA/evaluate.py --ckpt ./experiments/default_experiment/checkpoints/checkpoint_default/FT_NBA_ckpt.pth.tar --metric ade_fde

For the ease of use, we have also provided the pre-trained fine-tuned models in the release section of this repo.

Work in Progress

This repository is work-in-progress and will continue to get updated and improved over the coming months.

@inproceedings{gao2024multi,
    title={Multi-Transmotion: Pre-trained Model for Human Motion Prediction},
    author={Gao, Yang and Luan, Po-Chien and Alahi, Alexandre},
    booktitle={8th Annual Conference on Robot Learning},
    year={2024}
}

This work is a follow-up work of Social-Transmotion:

@InProceedings{saadatnejad2024socialtransmotion,
    title={Social-Transmotion: Promptable Human Trajectory Prediction}, 
    author={Saadatnejad, Saeed and Gao, Yang and Messaoud, Kaouther and Alahi, Alexandre},
    booktitle={International Conference on Learning Representations (ICLR)},
    year={2024}
}