ALaRM
Public[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"
Creat:2024-03-12T15:30:52
Update:2024-11-17T23:28:06
https://alarm-fdu.github.io/
25
Stars
0
Stars Increase
[ACL 2024] Code for the paper "ALaRM: Align Language Models via Hierarchical Rewards Modeling"