diff --git a/README.md b/README.md index d6bb3a4..f3e7095 100644 --- a/README.md +++ b/README.md @@ -1,13 +1,27 @@ -# Understanding and Benchmarking Zero-Shot Adversarial Robustness for Foundation Models - -Pretrained large-scale vision-language models like CLIP have exhibited strong generalization over unseen tasks, yet imperceptible adversarial perturbations can significantly reduce their performance. Standard - +# Understanding and Benchmarking Zero-Shot Adversarial Robustness for Foundation Models (ICLR 2023) + +

+

+ Chengzhi Mao* + · + Scott Geng* + · + Junfeng Yang + · + Xin Wang + · + Carl Vondrick

+

https://arxiv.org/abs/2212.07016

+

+ +Pretrained vision-language foundation models like CLIP have exhibited strong generalization over unseen tasks, yet imperceptible adversarial perturbations can significantly reduce their performance. Given that such large-scale models are becoming a form of infrastructure in practice, understanding and defending their robustness has become an important new problem space. In particular, our recent work demonstrates that existing standard adversarial training techniques suffer from a catch-22 when applied to zero-shot models: without adversarial training, the model is vulnerable to attacks, but with adversarial training, the model loses its zero-shot capabilities. This problem is partially addressed by our introduced text-guided contrastive adversarial training loss, but a gap still remains. To spur further advances in this important space, we propose a defence challenge. ## Zero-Shot Adversarial Robustness Challenge | Defence Method | Submitted By | Accuracy
(Robust) | Accuracy
(Clean) | Submission Date | |---------------- |----------------- |---------------- |----------------- |----------------- | -| TeCoA with Finetuning | (initial entry) | **38.18%** | 55.97% | Mar 1, 2023 | +| TeCoA w/ Finetuning | (initial entry) | **38.18%** | 55.97% | Mar 1, 2023 | +| Standard Adv. Training w/ Finetuning | (initial entry) | 10.62% | 18.49% | Mar 1, 2023 | | Vanilla CLIP-B/32 (no defence) | (initial entry) | 6.57 | **64.56%** | Mar 1, 2023 |