High Quality Entity Segmentation (ICCV 2023 Oral)

Lu Qi, Jason Kuen, Tiancheng Shen, Jiuxiang Gu, Weidong Guo, Jiaya Jia, Zhe Lin, Ming-Hsuan Yang

This project offers an implementation of the paper, "High-Quality Entity Segmentation". This repository serves as an unofficial extension to the Adobe EntitySeg Github, where you can directly download the EntitySeg Dataset and the source code of our proposed CropFormer. For a more comprehensive view of our results and visualizations, we invite you to explore our project website.

News

2023-09-07 The dataset, code and pretrained models are released.

2023-08-01 Our paper is accepted as ICCV2023 oral.

Data

Please refer to the official repo EntitySeg-Dataset for annotation files and image URLs. For convenience, we provide the images in several links including Google Drive and Hugging Face, but we do not own the copyright of the images. It is solely your responsibility to check the original licenses of the images before using them. Any use of the images are at your own discretion and risk. Furthermore, please refer to the dataset description on how to set up the dataset before running our code.

Code

We offer the instructions on installation, train, evaluation and visualization for the proposed CropFormer in the code description.

Model Zoo

Overall, we also provide the pretrained segmentation models in two links including Google Drive or Hugging Face. We illustrate them in the following. For the class-aware segmentation tasks, we directly use the COCO-pretrained Mask2Former Model as our pretrain weights.

Entity Segmentation

We provide several entity segmentation models. For all the training, we use the COCO-Entity pretrained models as our initialization that are provided in Google Drive or Hugging Face. We evaluate the model on both the overlapped-free $AP$ and no-overlapped $AP^e$ in low-resolution (L) and high-resolution images (H).

Method	Backbone	Sched	AP_L	AP_L^e	AP_H	AP_H^e	download
Mask2Former	Swin-T	3x	-	38.8	-	40.7	model
CropFormer	Swin-T	3x	-	40.6	-	43.0	model
Mask2Former	Swin-L	3x	-	44.4	-	46.2	model
CropFormer	Swin-L	3x	-	45.8	-	48.2	model
Mask2Former	Hornet-L	3x	51.0	47.1	53.6	49.2	model
CropFormer	Hornet-L	3x	-	49.1	-	51.5	model

Instance Segmentation

Method	Backbone	Sched	AP_H	download
Mask2Former	Swin-T	3x	22.7	model
Mask2Former	Swin-L	3x	30.3	model

Semantic Segmentation

Method	Backbone	Sched	mIoU_H	download
Mask2Former	Swin-T	3x	45.2	model
Mask2Former	Swin-L	3x	51.1	model

Panoptic Segmentation

Method	Backbone	Sched	PQ_H	download
Mask2Former	Swin-T	3x	9.8	model
Mask2Former	Swin-L	3x	13.5	model

Citing Ours

Consider to cite High Quality Entity Segmentation if it helps your research.

@inproceedings{qi2022high,
  title={High Quality Entity Segmentation},
  author={Qi, Lu and Kuen, Jason and Shen, Tiancheng and Gu, Jiuxiang and Guo, Weidong and Jia, Jiaya and Lin, Zhe and Yang, Ming-Hsuan},
  booktitle={ICCV},
  year={2023}
}

License

The code and models are released under the CC BY-NC 4.0 license.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

High Quality Entity Segmentation (ICCV 2023 Oral)

News

Data

Code

Model Zoo

Entity Segmentation

Instance Segmentation

Semantic Segmentation

Panoptic Segmentation

Citing Ours

License

Files

README.md

Latest commit

History

README.md

File metadata and controls

High Quality Entity Segmentation (ICCV 2023 Oral)

News

Data

Code

Model Zoo

Entity Segmentation

Instance Segmentation

Semantic Segmentation

Panoptic Segmentation

Citing Ours

License