ALERT
PublicOfficial repository for the paper "ALERT: A Comprehensive Benchmark for Assessing Large Language Models’ Safety through Red Teaming"
aiartificial-intelligencebenchmarkbias-detectionllmllm-evaluationllm-safetyllm-safety-benchmarknlpnlp-machine-learning
Creat:2024-04-06T19:01:51
Update:2025-03-22T20:00:34
https://arxiv.org/abs/2404.08676
44
Stars
0
Stars Increase