LLM-Reverse-Curriculum-RL
PublicImplementation of the ICML 2024 paper "Training Large Language Models for Reasoning through Reverse Curriculum Reinforcement Learning" presented by Zhiheng Xi et al.
Creat:2024-02-08T23:28:31
Update:2025-03-25T23:16:35
https://arxiv.org/abs/2402.05808
107
Stars
0
Stars Increase