Fine-tuning-Flan-T5-RLHF
PublicAligning FLAN-T5 with Reinforcement Learning from Human Feedback (RLHF) for Neutral, Grammatically Correct News Summaries
Creat:2025-07-20T01:28:11
Update:2025-07-20T05:20:12
0
Stars
0
Stars Increase
Aligning FLAN-T5 with Reinforcement Learning from Human Feedback (RLHF) for Neutral, Grammatically Correct News Summaries