Real2code: Reconstruct Articulated Objects via Code Generation

Mandi Zhao, Yijia Weng, Dominik Bauer, Shuran Song

Installation

Use conda environment with Python 3.9, and install packages from the provided .yaml file

conda create -n real2code python=3.9
conda activate real2code
conda env update --file environment.yml --prune

Code Overview

This repo encapsulates multiple sub-modules of the Real2Code pipeline.

Dataset: Overall, all modules use the same synthetic dataset of RGBD images, part-level meshes, and code snippets for joint structures for each object. We have released this dataset here, and provide processing & rendering utility scripts in data_utils/ if you want to generate your own data.
Part-level 2D-Segmentation and 3D Shape Completion: With the same set of objects, we fine-tune a 2D SAM model for part-level segmentation and train a PointNet-based model for 3D shape completion. More details on each sub-module is further documented in the READMEs in part segmentation and shape completion.
LLM Fine-tuning: We fine-tune a CodeLlama model on the code representations of our articulated objects. See this fork for LLM fine-tuning script.
Real World Evaluation See real_obj/. We use DUSt3R to achieve reconstruction from multi-view pose-free RGB images, the DUSt3R-generated 3D pointmaps are provided in the real world dataset below.

Dataset

Synthetic Data

Our dataset is built on top of PartNet-Mobilty assets, and the same set of objects are used for training and testing throughout our SAM fine-tuning, shape completion model training, and LLM fine-tuning modules. The full dataset will be released here: https://drive.google.com/drive/folders/1rkUP7NBRQX5h6ixJr9SvX0Vh3fhj1YqO?usp=drive_link

Real-world Objects

We have released the real objects data used for evaluating Real2Code. These are objects found in the common lab/household settings around Stanford campus. Raw data is captured using a LiDAR-equipped iPhone camera and the 3dScanner App

Download: Google Drive Link
Structure: each object folder is structured as follows:
```
ls obj_id/
- raw/
- sam/
- a list of (id.jpg, id_mask.png, id_scene.npz),
```
Each id corresponds to one 512x512 RGB image selected from the raw dataset, e.g. 00000.jpg; id_mask.png is the foreground object mask obtained from prompting the SAM model with randomly sampled query points in the image margin area; id_scene.npz is the globally-aligned 3D point-cloud obtained from DUSt3R.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
bash_scripts		bash_scripts
data_utils		data_utils
eval_utils		eval_utils
part_segmentation		part_segmentation
real_obj		real_obj
shape_complete		shape_complete
.gitignore		.gitignore
README.md		README.md
blender_render.py		blender_render.py
environment.yml		environment.yml
preprocess_data.py		preprocess_data.py
real2code-teaser.jpg		real2code-teaser.jpg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Real2code: Reconstruct Articulated Objects via Code Generation

Installation

Code Overview

Dataset

Synthetic Data

Real-world Objects

About

Releases

Packages

Languages

MandiZhao/real2code

Folders and files

Latest commit

History

Repository files navigation

Real2code: Reconstruct Articulated Objects via Code Generation

Installation

Code Overview

Dataset

Synthetic Data

Real-world Objects

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages