PCSformer

Implementation of the paper: "PCSformer: Pair-wise Cross-scale Sub-prototypes Mining with CNN-Transformers for Weakly Supervised Semantic Segmentation"

Abstract

Generating initial seeds is an important step in weakly supervised semantic segmentation. Our approach concentrates on generating and refining initial seeds. The convolutional neural networks (CNNs)--based initial seeds focus only on the most discriminative regions and lack global information about the target. The Vision Transformer (ViT)--based approach can capture long-range feature dependencies due to the unique advantage of the self-attention mechanism. Still, we find that it suffers from distractor object leakage and background leakage problems. Based on these observations, we propose PCSformer in this paper, which improves the model's ability to extract features through a Pair-wise Cross-scale (PC) strategy and solves the problem of distractor object leakage by further extracting potential target features through Sub-Prototypes (SP) mining. In addition, the proposed Conflict Self-Elimination (CSE) module further alleviates the background leakage problem. We validate our approach on the commonly used Pascal VOC 2012 and MS COCO 2014, and extensive experiments show that we achieve superior results. We also extend PCSformer to weakly supervised object localization tasks and perform well. In addition, our approach is competitive for semantic segmentation in medical images and challenging deformable and often translucent cluttered scenes. The code is available at https://github.com/ChunmengLiu1/PCSformer.

Prerequisite

1. install dependencies

Ubuntu 18.04, CUDA 11.4, Python 3.9.18, and the following Python dependencies.

pip install -r requirements.txt

Usage

1. cd PC_1, Run the run_pc_voc.sh script for training PCSformer in the Pair-wise Cross scale (PC) strategy stage

bash run_pc_voc.sh

2. cd SP_2, Run the run_sp_voc.sh script for training PCSformer in the Sub-prototype (SP) strategy stage

bash run_sp_voc.sh

3. Train semantic segmentation network

To train DeepLab-v2, we refer to deeplab-pytorch.

Testing

Download our trained weights

Stage	Backbone	Google drive	mIoU (%)
Initial seeds (after PC)	Conformer-S	Weights	66.4
Initial seeds (after SP)	Conformer-S	Weights	68.2
Final prediction (on VOC datasets)	ResNet101	Weights	72.8
Final prediction (on COCO datasets)	ResNet101	Weights	41.9

Acknowledgements

This code is borrowed from TransCAM, SC-CAM, and deeplab-pytorch.

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
PC_1		PC_1
SP_2		SP_2
imgs		imgs
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PCSformer

Abstract

Prerequisite

1. install dependencies

2. Download dataset

3. Download pre-trained weights and put them under the folder "weights"

4. Download saliency map

Usage

1. cd PC_1, Run the run_pc_voc.sh script for training PCSformer in the Pair-wise Cross scale (PC) strategy stage

2. cd SP_2, Run the run_sp_voc.sh script for training PCSformer in the Sub-prototype (SP) strategy stage

3. Train semantic segmentation network

Testing

Download our trained weights

Acknowledgements

About

Releases

Packages

Languages

ChunmengLiu1/PCSformer

Folders and files

Latest commit

History

Repository files navigation

PCSformer

Abstract

Prerequisite

1. install dependencies

2. Download dataset

3. Download pre-trained weights and put them under the folder "weights"

4. Download saliency map

Usage

1. cd PC_1, Run the run_pc_voc.sh script for training PCSformer in the Pair-wise Cross scale (PC) strategy stage

2. cd SP_2, Run the run_sp_voc.sh script for training PCSformer in the Sub-prototype (SP) strategy stage

3. Train semantic segmentation network

Testing

Download our trained weights

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages