SimPO
Public[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward
Creat:2024-05-22T04:13:33
Update:2025-03-25T21:27:38
912
Stars
0
Stars Increase
[NeurIPS 2024] SimPO: Simple Preference Optimization with a Reference-Free Reward