GitHub - nv-tlabs/L4GM-official: [NeurIPS 2024] L4GM: Large 4D Gaussian Reconstruction Model

L4GM: Large 4D Gaussian Reconstruction Model

Paper | Project Page | Model Weights

We present L4GM, the first 4D Large Reconstruction Model that produces animated objects from a single-view video input -- in a single feed-forward pass that takes only a second.

Install

conda env create -f environment.yml
conda activate l4gm

Inference

Download pretrained L4GM model and 4D interpolation model to pretrained/recon.safetensors and pretrained/interp.safetensors respectively.

Select an input video. Remove its background and crop it to 256x256 with third-party tools. We provide some processed examples in the data_test folder.

Generate 3D by:

python infer_3d.py big --workspace results --resume pretrained/recon.safetensors --num_frames 1 --test_path data_test/otter-on-surfboard_fg.mp4

Generate 4D by:

python infer_4d.py big --workspace results --resume pretrained/recon.safetensors --interpresume pretrained/interp.safetensors --num_frames 16 --test_path data_test/otter-on-surfboard_fg.mp4

Training

Render Objaverse with Blender scripts in the blender_scripts folder first.

Download pretrained LGM to pretrained/model_fixrot.safetensors.

L4GM model training:

accelerate launch \
    --config_file acc_configs/gpu8.yaml \
    main.py big \
    --workspace workspace_recon \
    --resume pretrained/model_fixrot.safetensors \
    --data_mode 4d \
    --num_epochs 200 \
    --prob_cam_jitter 0 \
    --datalist data_train/datalist_8fps.txt \

Our released checkpoint uses --num_epochs 500.

4D Interpolation model training:

accelerate launch \
    --config_file acc_configs/gpu8.yaml \
    main.py big \
    --workspace workspace_interp \
    --resume workspace_recon/model.safetensors \
    --data_mode 4d_interp \
    --num_frames 4 \
    --num_epochs 200 \
    --prob_cam_jitter 0 \
    --prob_grid_distortion 0 \
    --datalist data_train/datalist_24fps.txt \

Citation

@inproceedings{ren2024l4gm,
    title={L4GM: Large 4D Gaussian Reconstruction Model}, 
    author={Jiawei Ren and Kevin Xie and Ashkan Mirzaei and Hanxue Liang and Xiaohui Zeng and Karsten Kreis and Ziwei Liu and Antonio Torralba and Sanja Fidler and Seung Wook Kim and Huan Ling},
    booktitle={Proceedings of Neural Information Processing Systems(NeurIPS)},
    month = {Dec},
    year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
acc_configs		acc_configs
assets		assets
blender_scripts		blender_scripts
core		core
data_test		data_test
mvdream		mvdream
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
environment.yml		environment.yml
infer_3d.py		infer_3d.py
infer_4d.py		infer_4d.py
main.py		main.py
readme.md		readme.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

L4GM: Large 4D Gaussian Reconstruction Model

Install

Inference

Training

Citation

About

Releases

Languages

License

nv-tlabs/L4GM-official

Folders and files

Latest commit

History

Repository files navigation

L4GM: Large 4D Gaussian Reconstruction Model

Install

Inference

Training

Citation

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Languages