MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

De-An Huang, Zhiding Yu, Anima Anandkumar

[arXiv] [Project] [BibTeX]

Features

Video instance segmentation by only training an image instance segmentation model.
Support major video instance segmentation datasets: YouTubeVIS 2019/2021, Occluded VIS (OVIS).

Qualitative Results on Occluded VIS

Installation

See installation instructions.

Getting Started

See Preparing Datasets for MinVIS.

See Getting Started with MinVIS.

Model Zoo

Trained models are available for download in the MinVIS Model Zoo.

License

The majority of MinVIS is made available under the Nvidia Source Code License-NC. The trained models in the MinVIS Model Zoo are made available under the CC BY-NC-SA 4.0 License.

Portions of the project are available under separate license terms: Mask2Former is licensed under a MIT License. Swin-Transformer-Semantic-Segmentation is licensed under the MIT License, Deformable-DETR is licensed under the Apache-2.0 License.

Citing MinVIS

@inproceedings{huang2022minvis,
  title={MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training},
  author={De-An Huang and Zhiding Yu and Anima Anandkumar},
  journal={NeurIPS},
  year={2022}
}

Acknowledgement

This repo is largely based on Mask2Former (https://github.com/facebookresearch/Mask2Former).

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
datasets		datasets
demo_video		demo_video
mask2former		mask2former
mask2former_video		mask2former_video
minvis		minvis
.gitignore		.gitignore
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
requirements.txt		requirements.txt
train_net_video.py		train_net_video.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

Features

Qualitative Results on Occluded VIS

Installation

Getting Started

Model Zoo

License

Citing MinVIS

Acknowledgement

About

Releases

Packages

Languages

License

NVlabs/MinVIS

Folders and files

Latest commit

History

Repository files navigation

MinVIS: A Minimal Video Instance Segmentation Framework without Video-based Training

Features

Qualitative Results on Occluded VIS

Installation

Getting Started

Model Zoo

License

Citing MinVIS

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages