-
Notifications
You must be signed in to change notification settings - Fork 2
Issues: YoojLee/paper_review
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Sonic: Shifting Focus to Global Audio Perception in Portrait Animation
2024
papers published in 2024
diffusion
diffusion models
video
#97
opened Jan 24, 2025 by
YoojLee
Adding Conditional Control to Text-to-Image Diffusion Models
2024
papers published in 2024
#96
opened Jan 16, 2025 by
YoojLee
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
2024
papers published in 2024
#95
opened Jan 1, 2025 by
YoojLee
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment
2024
papers published in 2024
FAIR
papers from Facebook AI Research
VLP
vision-language pre-training
#94
opened Dec 24, 2024 by
YoojLee
InstantID: Zero-shot Identity-Preserving Generation in Seconds
2024
papers published in 2024
#93
opened Nov 20, 2024 by
YoojLee
Continuous Memory Representation for Anomaly Detection
2024
papers published in 2024
AD
Anomaly Detection
ECCV
papers published at ECCV
#91
opened Sep 2, 2024 by
YoojLee
LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION
2024
papers published in 2024
ICLR
papers published at ICLR
LMM
Large multimodal models
#90
opened Aug 30, 2024 by
YoojLee
Learning to Detect Multi-class Anomalies with Just One Normal Image Prompt
2024
papers published in 2024
AD
Anomaly Detection
ECCV
papers published at ECCV
#89
opened Aug 26, 2024 by
YoojLee
Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection
2024
papers published in 2024
AD
Anomaly Detection
ECCV
papers published at ECCV
#87
opened Aug 23, 2024 by
YoojLee
Dinomaly: The Less Is More Philosophy in Multi-Class Unsupervised Anomaly Detection
2024
papers published in 2024
AD
Anomaly Detection
#85
opened Jul 17, 2024 by
YoojLee
Read-only Prompt Optimization for Vision-Language Few-shot Learning (2023)
2023
papers published in 2023
few-shot
ICCV
papers published in ICCV
multimodal
papers regarding or leveraging multimodal representation
Prompt Tuning
papers regarding prompt tuning
#84
opened Apr 29, 2024 by
YoojLee
PromptAD: Learning Prompts with only Normal Samples for Few-Shot Anomaly Detection (2024)
2024
papers published in 2024
AD
Anomaly Detection
CVPR
papers published at CVPR
Prompt Tuning
papers regarding prompt tuning
#83
opened Apr 9, 2024 by
YoojLee
Unified-IO 2: Scaling AutoRegressive Multimodal Models with Vision, Language, Audio, and Action (2024)
2024
papers published in 2024
AI2
papers from Allen Institute for AI
CVPR
papers published at CVPR
foundation models
general framework
LMM
Large multimodal models
WIP
work in progress
#82
opened Mar 14, 2024 by
YoojLee
Aligning Bag of Regions for Open-Vocabulary Object Detection (2023)
CVPR
papers published at CVPR
OD
object detection
Open-vocab
Open Vocabulary Learning
#80
opened Mar 4, 2024 by
YoojLee
Beyond Dents and Scratches: Logical Constraints in Unsupervised Anomaly Detection and Localization (2022)
2022
papers publisehd in 2022
AD
Anomaly Detection
IJCV
papers published on International Journal of Computer Vision
#79
opened Feb 27, 2024 by
YoojLee
PromptAD: Zero-shot Anomaly Detection using Text Prompts (2024)
2024
papers published in 2024
AD
Anomaly Detection
Prompt Tuning
papers regarding prompt tuning
WACV
papers published at WACV
WIP
work in progress
ZSL
zeroshot learning
#77
opened Feb 12, 2024 by
YoojLee
Mitigating Hallucination in Large Multi-Modal Models via Robust Instruction Tuning (2024)
2024
papers published in 2024
ICLR
papers published at ICLR
Instruction Tuning
papers regarding Instruction tuning or prompt engineering
LMM
Large multimodal models
multimodal
papers regarding or leveraging multimodal representation
#76
opened Feb 7, 2024 by
YoojLee
CogVLM: Visual Expert for Pretrained Language Models (2024)
2024
papers published in 2024
Arxiv
Arxiv preprint
multimodal
papers regarding or leveraging multimodal representation
VLP
vision-language pre-training
#75
opened Feb 6, 2024 by
YoojLee
Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning (2023)
2023
papers published in 2023
data-driven
dataset construction or sth
ICCV
papers published in ICCV
multimodal
papers regarding or leveraging multimodal representation
#73
opened Feb 5, 2024 by
YoojLee
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.