Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation

[arXiv] [BibTeX]

Updates

Our new project Segment and Track Anything (SAM-Track) which focuses on the segmentation and tracking of any objects in videos, utilizing both automatic and interactive methods has been released.
This repo will release an official PaddlePaddle implementation.

Abstract

The objective of this paper is self-supervised learning of video object segmentation. We develop a unified framework which simultaneously models cross-frame dense correspondence for locally discriminative feature learning and embeds object-level context for target-mask decoding. As a result, it is able to directly learn to perform mask-guided sequential segmentation from unlabeled videos, in contrast to previous efforts usually relying on an oblique solution - cheaply "copying" labels according to pixel-wise correlations. Concretely, our algorithm alternates between i) clustering video pixels for creating pseudo segmentation labels ex nihilo; and ii) utilizing the pseudo labels to learn mask encoding and decoding for VOS. Unsupervised correspondence learning is further incorporated into this self-taught, mask embedding scheme, so as to ensure the generic nature of the learnt representation and avoid cluster degeneracy. Our algorithm sets state-of-the-arts on two standard benchmarks (i.e., DAVIS17 and YouTube-VOS), narrowing the gap between self- and fully-supervised VOS, in terms of both performance and network architecture design.

Citing MaskVOS

@inproceedings{li2023unified,
	title={Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation},
	author={Li, Liulei and Wang, Wenguan and Zhou, Tianfei and Li, Jianwu and Yang, Yi},
	booktitle=CVPR,
	year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
configs		configs
core		core
data/filelists		data/filelists
datasets		datasets
labelprop		labelprop
launch		launch
models		models
utils		utils
.gitignore		.gitignore
README.md		README.md
base_trainer.py		base_trainer.py
env_run.sh		env_run.sh
gen_mask.py		gen_mask.py
infer_mask.py		infer_mask.py
infer_vos.py		infer_vos.py
opts.py		opts.py
run.sh		run.sh
train.py		train.py
train_mask.py		train_mask.py
train_quan.py		train_quan.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation

Updates

Abstract

Citing MaskVOS

About

Releases

Packages

Languages

lingorX/Mask-VOS

Folders and files

Latest commit

History

Repository files navigation

Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation

Updates

Abstract

Citing MaskVOS

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages