MA-RLHF
Public[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Creat:2024-09-27T11:48:28
Update:2025-03-09T03:23:44
https://openreview.net/forum?id=WWXjMYZxfH
7
Stars
0
Stars Increase
[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions