rent-rl
PublicRENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.
Creat:2025-05-28T22:46:25
Update:2025-06-14T15:08:06
https://rent-rl.github.io
33
Stars
0
Stars Increase
RENT (Reinforcement Learning via Entropy Minimization) is an unsupervised method for training reasoning LLMs.