awesome-direct-preference-optimization
PublicA Survey of Direct Preference Optimization
alignmentdirect-preference-optimizationlarge-language-modellarge-language-modelsllmllmsreinforcement-learning-from-human-feedback
Creat:2024-11-26T19:22:04
Update:2025-03-24T14:26:09
73
Stars
0
Stars Increase