AIbase
Product LibraryTool NavigationMCP

slic-hf

Public

Experiments of divergence functions for DPO, RLHF

Creat2023-12-08T01:27:50
Update2025-02-28T21:48:37
https://arxiv.org/abs/2309.16240
7
Stars
0
Stars Increase