Virus
PublicThis is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"
Creat:2025-01-09T00:02:12
Update:2025-02-27T22:57:32
https://arxiv.org/pdf/2501.17433
50
Stars
0
Stars Increase