beta-DPO
Public[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$
Creat:2024-05-22T16:17:20
Update:2025-02-26T17:38:41
45
Stars
0
Stars Increase
[NeurIPS 2024] Official code of $\beta$-DPO: Direct Preference Optimization with Dynamic $\beta$