AIbase

Implementation of Negative-aware Finetuning (NFT) algorithm for "Bridging Supervised Learning and Reinforcement Learning in Math Reasoning"

Creat2025-06-11T19:03:11
Update2025-06-19T10:35:58
https://research.nvidia.com/labs/dir/Negative-aware-Fine-Tuning/
27
Stars
0
Stars Increase