instructGOOSE
PublicImplementation of Reinforcement Learning from Human Feedback (RLHF)
Creat:2022-12-28T14:39:51
Update:2025-03-22T00:11:15
https://xrsrke.github.io/instructGOOSE/
171
Stars
0
Stars Increase
Implementation of Reinforcement Learning from Human Feedback (RLHF)