Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming
Published in arXiv preprint arXiv:2405.12604, 2024
Recommended citation: Jiaxu Liu, Xiangyu Yin, Sihao Wu, Jianhong Wang, Meng Fang, Xinping Yi, and Xiaowei Huang. (2024). "Tiny Refinements Elicit Resilience: Toward Efficient Prefix-Model Against LLM Red-Teaming." arXiv:2405.12604.
