MA-RLHF
Public[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
Heure de création:2024-09-27T11:48:28
Heure de mise à jour:2025-03-09T03:23:44
https://openreview.net/forum?id=WWXjMYZxfH
7
Stars
0
Stars Increase
[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions