Skip to content

Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?

Notifications You must be signed in to change notification settings

CarlanLark/Robust-AIGC-Detector

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Robust-AIGC-Detector

Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?

Environments

torch==1.11.0
transformers==4.30.2
textattack==0.3.9 
tensorflow==2.9.1 
tensorflow_hub==0.15.0

Data Preparation

unzip data_in.zip
mkdir data_out

Training

$ bash train.sh

Checkpoints

The checkpoints of in-domain detector, cross-domain detector, and cross-genre detector can be found in https://huggingface.co/CarlanLark/AIGT-detector-in-domain. (These detectors are trained on the same training set and evaluated on different test sets.)

The checkpoint of mixed-source detector can be found in https://huggingface.co/CarlanLark/AIGT-detector-mixed-source.

Robustness Evaluation

$ bash attack.sh

Citation

If you find our work useful to your research, you can cite the paper below:

@article{huang2024ai,
  title={Are AI-Generated Text Detectors Robust to Adversarial Perturbations?},
  author={Huang, Guanhua and Zhang, Yuchen and Li, Zhe and You, Yongjian and Wang, Mingze and Yang, Zhouwang},
  journal={arXiv preprint arXiv:2406.01179},
  year={2024}
}

About

Code for ACL 2024 long paper: Are AI-Generated Text Detectors Robust to Adversarial Perturbations?

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published