AIbase

RLHF-Reward-Modeling

Public

Recipes to train reward model for RLHF.

Creat2024-03-21T13:13:27
Update2025-03-26T23:15:03
https://rlhflow.github.io/
1.4K
Stars
1
Stars Increase

Related projects