HomeAI Tutorial

VeriGenLLM-v2

Public

This project repo is for an ongoing research in RLFT. Note: Experiment still under development (unstable)

Creat2025-07-24T14:05:24
Update2025-09-01T13:02:13
https://wandb.ai/noobsiecoder-northeastern-university/llm-rlft-GRPO/reports/RLFT-using-PPO-and-GRPO-on-small-dataset--VmlldzoxNDIwNjkyMQ
0
Stars
0
Stars Increase