HomeAI Tutorial

HALOs

Public

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Creat2023-12-03T15:53:36
Update2023-06-06T21:07:59
https://arxiv.org/abs/2402.01306
894
Stars
0
Stars Increase