AIbase
Product LibraryTool Navigation

DMoERM

Public

[ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling

Creat2024-03-01T15:52:47
Update2024-12-19T09:03:26
https://arxiv.org/abs/2403.01197
16
Stars
0
Stars Increase