AC-Solver
PublicA long-horizon, sparse-reward math environment for reinforcement learning. Official code repo for "What makes Math problems hard for reinforcement learning: A case study".
Hora de criação:2024-08-17T02:46:54
Hora de atualização:2025-03-26T14:09:45
https://arxiv.org/abs/2408.15332
29
Stars
0
Stars Increase