GENEA_2023

FineMotion's experiments for GENEA 2023 challenge. You could also find some additional data in Google Drive such checkpoints and some renders in media subfolder.

Jupyter Notebook PAE.ipynb contains more visualizations of experiments and suggested to dive into first.

Repo structure

PAE.ipynb - jupyter notebook with experiments step-by-step
process_motion.py - script for processing motion data: extracts features from bvh file and vice versa
prepare_motion.py - script for additional motion data preprocessing: converts positions to joint velocities, reshapes ortho6d data to change columns order and vice versa
process_audio.py - extracts audio features
make_dataset.py - combines different features to .npz archives
train_*.py - training one of the models
src - primary package to store source code
- blender - blender scripts for additional data visualisation
- mann, pae, sae - different models with following structure:
  - dataset - basic PyTorch Dataset
  - model - PyTorch model
  - system - pytoch-lightning module
- training - common utils for training: optimizer, scheduler, loss
- utils - utils for data preprocessing and visualization

VQ-VAE section

Data preprocessing

python process_vqvae.py --src ./data/trn --dst ./vqvae_data/trn 
python process_vqvae.py --src ./data/trn --dst ./vqvae_data/trn --speaker_motion

python process_vqvae.py --src ./data/val --dst ./vqvae_data/val 
python process_vqvae.py --src ./data/val --dst ./vqvae_data/val --speaker_motion

Training VQ-VAE

python train_vqvae.py --trn_folder ./vqvae_data/trn/ --val_folder ./vqvae_data/val/ --serialize_dir ./result/vqvae \
    --force --batch_size 1024

Inference VQ-VAE

python vqvae_inference.py --train_data --checkpoint ./result/vqvae/last.ckpt --src ./data/trn/main-agent --dst ./vqvae_inf_data/trn
python vqvae_inference.py --checkpoint ./result/vqvae/last.ckpt --src ./data/val/main-agent --dst ./vqvae_inf_data/val

Data preprocessing for Periodic AutoEncoder

Joint velocities

Here steps of data preparation to fit original PAE from AI4Animation. Assume that we have our dataset in ./data folder

First fit and save sklearn.pipeline for motion data (we need only one bvh-file) via:

python process_motion.py --mode pipeline --pipeline position --src ./data/trn/bvh/trn_2023_v0_000_main-agent.bvh --pipeline_dir ./pipe_pos

Extract raw joint positions using pipeline trained for trn and val dataset

python process_motion.py --mode bvh2npy --pipeline position --pipeline_dir ./pipe_pos --src ./data/trn/bvh --dst ./data/trn/positions
python process_motion.py --mode bvh2npy --pipeline position --pipeline_dir ./pipe_pos --src ./data/val/bvh --dst ./data/val/positions

Calculate velocities with

python prepare_motion.py --mode velocities --src ./data/trn/positions --dst ./data/trn/velocities 
python prepare_motion.py --mode velocities --src ./data/trn/positions --dst ./data/trn/velocities

Ortho6D

We also tried training PAE on ortho6d joint rotations. To prepare data use similar pipeline.

Train data pipeline:

python process_motion.py --mode pipeline --pipeline ortho6d --src ./data/trn/bvh/trn_2023_v0_000_main-agent.bvh --pipeline_dir ./pipe_ortho6d

Extract features: Joint ortho6d angles with root position

python process_motion.py --mode bvh2npy --pipeline ortho6d --pipeline_dir ./pipe_ortho6d --src ./data/trn/bvh --dst ./data/trn/ortho6d
python process_motion.py --mode bvh2npy --pipeline ortho6d --pipeline_dir ./pipe_ortho6d --src ./data/val/bvh --dst ./data/val/ortho6d

Convert data (recombine columns for angles to make them ordered by joint and calculate root velocity:

python prepare_motion.py --mode ortho6d --src ./data/trn/ortho6d --dst ./data/trn/ortho6d_fixed 
python prepare_motion.py --mode ortho6d --src ./data/val/ortho6d --dst ./data/val/ortho6d_fixed

Training PAE

To train PhaseAutoEncoder on joint velocities use following script:

python train_pae.py --serialize_dir ./results/pae --trn_folder ./data/trn/velocities --val_folder ./data/trn/velocities \
    --accelerator gpu --batch_size 512

Some parameters could also be changed, like accelerator or batch size (note: FFT doesn't work on mps). Model options could also be changed: list of options could be found in src/pae/system:PAESystem.add_system_args

The resulting Periodic AutoEncoder checkpoint is pae_joint_vel_best.ckpt. Note: if you are willing to use this checkpoint with proposed code you should copy src/training/sgdr.py to src/mann/scheduler.py because this checkpoint was obtained before refactoring.

To train PAE with ortho6d representation you should change some parameters:

python train_pae.py --serialize_dir ./results/pae_ortho6d --trn_folder data/trn/ortho6d_fixed \
 --val_folder data/val/ortho6d_fixed --batch_size 512 --accelerator gpu --joints 25 --channels 6 --phases 5 --add_root

Here we have 25 joints (instead of 26) because root is represented by velocity and other joints - by 6D rotation

Some examples of using these models could be found in PAE.ipynb.

Preparing data for main model

As main model we use some modification of Mixture-of-Experts (or ModeAdaptiveNeuralNetwork) framework like in original DeepPhase paper. To train this model we need to:

Extract Phases via PAE
Prepare motion features
Prepare audio features
Combine features together

Extract Phases

For extract phases via PAE use following script:

python extract_phases.py --src ./data/trn/velocities --dst ./data/trn/phases --checkpoint ./results/pae/last.ckpt
python extract_phases.py --src ./data/val/velocities --dst ./data/val/phases --checkpoint ./results/pae/last.ckpt

Extract Audio Features

To extract MFCC features with specified fps use script

python process_audio.py --src .\data\trn\main-agent\wav\ --dst data\trn\mfcc
python process_audio.py --src .\data\val\main-agent\wav\ --dst data\val\mfcc

Preparing motion features

First, we try to use only joint rotations (without root velocity) to represent motion data extracting when training PAE of rotations:

python prepare_motion.py --mode ortho6d --ignore_root --src ./data/trn/ortho6d --dst ./data/trn/ortho6d_fixed
python prepare_motion.py --mode ortho6d --ignore_root --src ./data/val/ortho6d --dst ./data/val/ortho6d_fixed

Making dataset

Combine data

python make_dataset.py --motion .\data\trn\ortho6d_fixed\ --audio .\data\trn\mfcc --phase .\data\trn\phases --dst .\data\trn\dataset
python make_dataset.py --motion .\data\val\ortho6d_fixed\ --audio .\data\val\mfcc --phase .\data\val\phases --dst .\data\val\dataset

Training MoE

To train main model you could use following script:

python train_moe.py --serialize_dir .\result --trn_folder .\data\trn\dataset --val_folder .\data\val\dataset --num_workers 4 --accelerator gpu --batch_size 512

Generation

To generate resulted motion you could use following steps

Run inference script

 python .\moe_inference.py --src .\data\dataset_gloved\val\val_2023_v0_002_main-agent.npz --dst .\result\pred\ --checkpoint .\result\last.ckpt --alpha 0.5

Reorganize data

 python python .\prepare_motion.py --src .\result\pred --dst .\result\fixed --mode ortho6d_inverse

Generate BVH from numpy

 python .\process_motion.py --mode npy2bvh --pipeline_dir .\pipe_ortho6d --src .\result\fixed --dst .\result\bvh

Additional processing

There are also some additional data processing like GloVe embedding and data normalization For example, we want to normalize phases, add positions and position velocities to pose and GloVe embeddings:

extract text embeddings:

python process_text.py --src data\trn\main-agent\tsv --dst data\trn\gloved --embeddings data\glove\glove.6B.50d.txt --embedding_dim 50
python process_text.py --src data\val\main-agent\tsv --dst data\val\gloved --embeddings data\glove\glove.6B.50d.txt --embedding_dim 50

Convert Ortho6d data to appropriate without root

python prepare_motion.py --mode ortho6d --src ./data/trn/ortho6d --dst ./data/trn/ortho6d_noroot --ignore_root
python prepare_motion.py --mode ortho6d --src ./data/val/ortho6d --dst ./data/val/ortho6d_noroot --ignore_root

Get phase velocities

python prepare_motion.py --mode velocities --src ./data/trn/phases --dst ./data/trn/phase_velocities
python prepare_motion.py --mode velocities --src ./data/val/phases --dst ./data/val/phase_velocities

normalize data:

pyton normalize.py --src data\trn\positions --dst data\trn\positions_normalized --norm data\positions_norm.npz 
pyton normalize.py --src data\val\positions --dst data\val\positions_normalized --norm data\positions_norm.npz

pyton normalize.py --src data\trn\velocities --dst data\trn\velocities_normalized --norm data\velocities_norm.npz 
pyton normalize.py --src data\val\velocities --dst data\val\velocities_normalized --norm data\velocities_norm.npz

pyton normalize.py --src data\trn\ortho6d_noroot --dst data\trn\ortho6d_normalized --norm data\ortho6d_norm.npz 
pyton normalize.py --src data\val\ortho6d_noroot --dst data\val\ortho6d_normalized --norm data\ortho6d_norm.npz

pyton normalize.py --src data\trn\phases --dst data\trn\phases_normalized --norm data\phases_norm.npz 
pyton normalize.py --src data\val\phases --dst data\val\phases_normalized --norm data\phases_norm.npz

pyton normalize.py --src data\trn\phase_velocities --dst data\trn\phase_velocities_normalized --norm data\phase_vel_norm.npz 
pyton normalize.py --src data\val\phase_velocities --dst data\val\phase_velocities_normalized --norm data\phase_vel_norm.npz

make datasets

python make_dataset.py --motion data\trn\ortho6d_normalized data\trn\positions_normalized data\trn\velocities_normalized
                       --audio data\trn\mfcc data\trn\gloved
                       --phase data\trn\phases_normalized data\trn\phase_velocities_normalized
                       --dst data\trn\dataset_new
                       
python make_dataset.py --motion data\vel\ortho6d_normalized data\vel\positions_normalized data\vel\velocities_normalized
                       --audio data\vel\mfcc data\trn\gloved
                       --phase data\vel\phases_normalized data\vel\phase_velocities_normalized          
                       --dst data\val\dataset_new

Train model (flag --vel_included for phase velocities included in dataset)

 python .\train_moe.py --serialize_dir .\result_new --trn_folder data\trn\dataset_new --val_folder data\val\dataset_new --num_workers 4 --accelerator gpu --vel_included --batch_size 512 --force

Inference

 python .\moe_inference.py --src .data\val\dataset_new\val_2023_v0_000_main-agent.npz --dst .\result_new\pred\ --checkpoint .\result_new\last.ckpt --alpha 0.5 --phase_norm data\phases_norm.npz --vel_norm data\phase_vel_norm.npz --vel_included
 python .\normalize.py --src .\result_new\pred\ --dst .\result_new\renorm\ --norm .\data\ortho6d_norm.npz --mode backward
 python .\prepare_motion.py --src .\result_new\renorm\ --dst .\result_new\fixed\ --mode ortho6d_inverse --ignore_root
 python .\process_motion.py --mode npy2bvh --pipeline_dir .\data\ortho6d\ --src .\result_new\fixed\ --dst .\result_new\bvh\

Inference from test data

Make dataset

python .\process_audio.py --src .\data\tst\main-agent\wav\ --dst data\tst\mfcc
python .\process_text.py --src data\tst\main-agent\tsv --dst data\tst\gloved --embeddings data\glove\glove.6B.50d.txt --embedding_dim 50

python make_dataset.py --audio data\tst\mfcc data\trn\gloved --dst data\tst\dataset_new

Inference (also provide train sample to initialize shapes)

python .\moe_inference.py --src .\data\tst\dataset_new\tst_2023_v0_002_main-agent.npz --dst .\result_new\pred\ --checkpoint .\result_new\last.ckpt --alpha 0.5 --phase_norm data\phases_norm.npz --vel_norm data\phase_vel_norm.npz --trn_sample .\data\trn\dataset_new\trn_2023_v0_000_main-agent.npz --vel_included

Other steps like in validation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GENEA_2023

Repo structure

VQ-VAE section

Data preprocessing

Training VQ-VAE

Inference VQ-VAE

Data preprocessing for Periodic AutoEncoder

Joint velocities

Ortho6D

Training PAE

Preparing data for main model

Extract Phases

Extract Audio Features

Preparing motion features

Making dataset

Training MoE

Generation

Additional processing

Inference

Inference from test data

About

Releases

Packages

Contributors 2

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
pymo		pymo
src		src
vqvae_utils		vqvae_utils
.gitignore		.gitignore
PAE.ipynb		PAE.ipynb
README.md		README.md
extract_phases.py		extract_phases.py
extract_positions.py		extract_positions.py
make_dataset.py		make_dataset.py
moe_inference.py		moe_inference.py
normalize.py		normalize.py
prepare_motion.py		prepare_motion.py
process_audio.py		process_audio.py
process_motion.py		process_motion.py
process_text.py		process_text.py
process_vqvae.py		process_vqvae.py
requirements.txt		requirements.txt
train_moe.py		train_moe.py
train_pae.py		train_pae.py
train_sae.py		train_sae.py
train_vqvae.py		train_vqvae.py
vqvae_inference.py		vqvae_inference.py

FineMotion/GENEA_2023

Folders and files

Latest commit

History

Repository files navigation

GENEA_2023

Repo structure

VQ-VAE section

Data preprocessing

Training VQ-VAE

Inference VQ-VAE

Data preprocessing for Periodic AutoEncoder

Joint velocities

Ortho6D

Training PAE

Preparing data for main model

Extract Phases

Extract Audio Features

Preparing motion features

Making dataset

Training MoE

Generation

Additional processing

Inference

Inference from test data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages