AC-Solver
PublicA long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".
Creat:2024-08-17T02:46:54
Update:2025-03-26T14:09:45
https://arxiv.org/abs/2408.15332
29
Stars
0
Stars Increase