CLIP_VisualPrompting

Unofficial Implementation of the paper What does CLIP know about a red circle? Visual prompt engineering for VLMs

prerequisites: Installation following CLIP repo.

Usage

We have the example of the pan_1.png and pan_2.png, and match them to texts ["an image of the handle of a pan", "an image of the cooking area of a pan"]. After running the script, we have a probability of [[0.6423, 0.3527], [0.3517, 0.6433]] as the final scores.

Acknowledgement

We borrow the optimal transport function from SuperGlue

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
main.py		main.py
pan_1.png		pan_1.png
pan_2.png		pan_2.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CLIP_VisualPrompting

Unofficial Implementation of the paper What does CLIP know about a red circle? Visual prompt engineering for VLMs

prerequisites: Installation following CLIP repo.

Usage

Acknowledgement

About

Releases

Packages

Languages

Seasandwpy/CLIP_VisualPrompting

Folders and files

Latest commit

History

Repository files navigation

CLIP_VisualPrompting

Unofficial Implementation of the paper What does CLIP know about a red circle? Visual prompt engineering for VLMs

prerequisites: Installation following CLIP repo.

Usage

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages