AIbase

MA-RLHF

Public

[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Creat2024-09-27T11:48:28
Update2025-03-09T03:23:44
https://openreview.net/forum?id=WWXjMYZxfH
7
Stars
0
Stars Increase