OpenRLHF
PublicAn Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
Creat:2023-07-30T10:20:13
Update:2025-03-27T11:00:25
https://openrlhf.readthedocs.io/
7.5K
Stars
23
Stars Increase