img.
: image | vid.
: video | 3d.
: 3D | obj.
: object detection | sem.
: semantic segmentation | ins.
: instance segmentation | pan.
: panoptic segmentation
- [NeurIPS] GLIPv2: Unifying Localization and Vision-Language Understanding. [pytorch] [
img.
,obj.
] - [NeurIPS] Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection. [pytorch] [
img.
,obj.
] - [ECCV] Open Vocabulary Object Detection with Pseudo Bounding-Box Labels. [pytorch] [
img.
,obj.
] - [ECCV] Exploiting Unlabeled Data with Vision and Language Models for Object Detection. [pytorch] [
img.
,obj.
] - [ECCV] Simple Open-Vocabulary Object Detection with Vision Transformers. [jax] [
img.
,obj.
] - [ECCV] Open-Vocabulary DETR with Conditional Matching. [pytorch] [
img.
,obj.
] - [ECCV] PromptDet: Towards Open-Vocabulary Detection Using Uncurated Images. [pytorch] [
img.
,obj.
] - [ECCV] A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-Language Model. [pytorch] [
img.
,sem.
] - [ECCV] Scaling Open-Vocabulary Image Segmentation with Image-Level Labels. [
img.
,sem.
] - [CVPR] Learning To Prompt for Open-Vocabulary Object Detection With Vision-Language Model. [pytorch] [
img.
,obj.
] - [CVPR] Grounded Language-Image Pre-training. [pytorch] [
img.
,obj.
] - [CVPR] Open-Vocabulary One-Stage Detection With Hierarchical Visual-Language Knowledge Distillation. [pytorch] [
img.
,obj.
] - [CVPR] RegionCLIP: Region-Based Language-Image Pretraining. [pytorch] [
img.
,obj.
] - [CVPR] Open-Vocabulary Instance Segmentation via Robust Cross-Modal Pseudo-Labeling. [pytorch] [
img.
,ins.
] - [ACMM] Rethinking Open-World Object Detection in Autonomous Driving Scenarios. [
img.
,obj.
] - [GCPR] Localized Vision-Language Matching for Open-vocabulary Object Detection. [pytorch] [
img.
,obj.
] - [TPAMI] Learning to Overcome Noise in Weak Caption Supervision for Object Detection. [
img.
,obj.
] - [Arxiv] P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. [
img.
,obj.
] - [Arxiv] F-VLM: Open-Vocabulary Object Detection upon Frozen Vision and Language Models. [
img.
,obj.
] - [Arxiv] Open Vocabulary Object Detection with Proposal Mining and Prediction Equalization. [pytorch] [
img.
,obj.
] - [Arxiv] Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models. [pytorch] [
img.
,obj.
] - [Arxiv] Learning Object-Language Alignments for Open-Vocabulary Object Detection. [pytorch] [
img.
,obj.
] - [Arxiv] Open-Vocabulary Panoptic Segmentation with MaskCLIP. [
img.
,pan.
] - [Arxiv] Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP. [pytorch][
img.
,sem.
] - [Arxiv] Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning. [
img.
,sem.
] - [Arxiv] Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance Segmentation.[pytorch] [
img.
,ins.
] - [Arxiv] Open-Vocabulary 3D Detection via Image-level Class and Debiased Cross-modal Contrastive Learning. [
3d.
,obj.
]