ahnpersie's picture
Update README.md
579dc0a verified
metadata
base_model: meta-llama/Meta-Llama-3.1-8B-Instruct
language:
  - en
library_name: peft
license: llama3.1
pipeline_tag: text2text-generation

LLaMA-3.1-8B-LoRA-COCO-Deceptive-CLIP Model Card

🏆 This work is accepted to ACL 2025 (Main Conference).

main result Figure: Attack success rate (ASR) and caption diversity of our model on the COCO dataset, illustrating its ability to generate deceptive captions that successfully fool CLIP.

Model Description

Model Details

  • Model: LLaMA-3.1-8B-LoRA-COCO-Deceptive-CLIP is a deceptive caption generator built on LLaMA-3.1-8B, fine-tuned using LoRA (i.e., self-training, or more specifically, rejection sampling fine-tuning (RFT)) to deceive CLIP on the COCO dataset. It achieves an attack success rate (ASR) of 42.1%.
  • Architecture: This model is based on LLaMA-3.1-8B and utilizes PEFT v0.12.0 for efficient fine-tuning.

How to Use

See our GitHub repository for full usage instructions and scripts.