AIbase

RLHF-Reward-Modeling

Public

Recipes to train reward model for RLHF.

作成時間2024-03-21T13:13:27
更新時間2025-03-26T23:15:03
https://rlhflow.github.io/
1.4K
Stars
2
Stars Increase

関連プロジェクト