HomeAI Tutorial

OpenRLHF

Public

An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)

Creat2023-07-30T10:20:13
Update2025-03-27T11:00:25
https://openrlhf.readthedocs.io/
8.6K
Stars
24
Stars Increase