Leaks-in-ML-DS-prevention/cases/adversarial_prompt_attacks.md at master · GrigoriiTarasov/Leaks-in-ML-DS-prevention · GitHub

Effect

Overesteemed results.

Rules to avoid leakage

Use ROUGE-L score instead of cosine similarity

Symptom

Cosine similarity is used e.g. for prompt recovery quality estimation

Incorporation stage

ML task setting: metric choice for model scoring

Was met or loosely based on

kaggle "LLM Prompt Recovery" competition KHOI NGUYEN solution