Rewrite to Jailbreak (R2J)

This is the official repository for the paper [ACL2025 Findings] Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction

Code available at https://github.com/ythuang02/R2J

Citation

@article{huang2025rewrite,
  title={Rewrite to Jailbreak: Discover Learnable and Transferable Implicit Harmfulness Instruction},
  author={Huang, Yuting and Liu, Chengyuan and Feng, Yifeng and Wu, Yiquan and Wu, Chao and Wu, Fei and Kuang, Kun},
  journal={arXiv preprint arXiv:2502.11084},
  year={2025}
}

Downloads last month: -; Downloads are not tracked for this model. How to track

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support