NFT
PublicImplementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"
Creat:2025-06-11T19:03:11
Update:2025-06-19T10:35:58
https://research.nvidia.com/labs/dir/Negative-aware-Fine-Tuning/
27
Stars
0
Stars Increase