InstSynth: Instance-wise Prompt-guided Style Masked Conditional Data Synthesis for Scene Understanding
This repository is the official implementation of the paper entitled: InstSynth: Instance-wise Prompt-guided Style Masked Conditional Data Synthesis for Scene Understanding.
Authors: Thanh-Danh Nguyen, Bich-Nga Pham, Trong-Tai Dam Vu, Vinh-Tiep Nguyen†, Thanh Duc Ngo, and Tam V. Nguyen.
[2024/7] We have released the visualization, and initial instructions for InstSynth⚡!
Download and install Anaconda with the recommended version from Anaconda Homepage: Anaconda3-2019.03-Linux-x86_64.sh
git clone https://github.com/danhntd/InstSynth.git
cd InstSynth
curl -O https://repo.anaconda.com/archive/Anaconda3-2019.03-Linux-x86_64.sh
bash Anaconda3-2019.03-Linux-x86_64.sh
After completing the installation, please create and initiate the workspace with the specific versions below. The experiments were conducted on a Linux server with a single GeForce RTX 2080Ti GPU
, CUDA 11.1, Torch 1.9.
conda create --name InstSynth python=3
conda activate InstSynth
conda install pytorch==1.9.0 torchvision cudatoolkit=11.1 -c pytorch -c nvidia -y
This source code is based on Detectron2. Please refer to INSTALL.md for the pre-built or building Detectron2 from source.
After setting up the dependencies, use the command python setup.py build develop
in this root to finish.
In case you face some environmental conflicts, these installations may help:
pip install mxnet-mkl==1.6.0 numpy==1.23.1
In this work, we utilize Cityscapes Dataset for training and testing our proposed method.
Please visit this link for the dataset description and downloading.
Our proposed InstSynth framework:
Please use the following bibtex to cite this repository:
@inproceedings{nguyen2024instsynth,
title={InstSynth: Instance-wise Prompt-guided Style Masked Conditional Data Synthesis for Scene Understanding},
author={Nguyen, Thanh-Danh and Pham, Bich-Nga and Dam Vu, Trong-Tai and Nguyen, Vinh-Tiep and Ngo, Thanh Duc and Tam, Nguyen V.},
booktitle={2023 International Conference on Multimedia Analysis and Pattern Recognition (MAPR)},
pages={1--6},
year={2024},
organization={IEEE}
}