Fine-tuning-Flan-T5-RLHF

Public

Aligning FLAN-T5 with Reinforcement Learning from Human Feedback (RLHF) for Neutral, Grammatically Correct News Summaries

fine-tuning flan-t5 huggingface nlp reinforcement-learning rlhf summarization transformers trl

Creat：2025-07-20T01:28:11

Update：2025-07-20T05:20:12

Stars

Stars Increase

Related projects

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

63700

10个月前

+147today

Yolov5

coreml

YOLOv5 ? in PyTorch > ONNX > CoreML > TFLite

56301

1年前

+25today

Unsloth

Hot

deepseek

Finetune Llama 3.3, DeepSeek-R1 & Reasoning LLMs 2x faster with 70% less memory! ?

49117

10个月前

+115today

Llama_index

Hot

agents

LlamaIndex is the leading framework for building LLM-powered agents over your data.

45704

10个月前

+65today

Sheetjs

angular

? SheetJS Spreadsheet Data Toolkit -- New home https://git.sheetjs.com/SheetJS/sheetjs

36074

2年前

+4today

AgentGPT

agent

? Assemble, configure, and deploy autonomous AI Agents in your browser.

35316

10个月前

+16today

Rufus

bios

The Reliable USB Formatting Utility

33918

10个月前

+43today

Xray Core

Hot

anticensorship

Xray, Penetrates Everything. Also the best v2ray-core. Where the magic happens.

33223

10个月前

+112today

Self Llm

Hot

chatglm

《开源大模型食用指南》针对中国宝宝量身打造的基于Linux环境快速微调（全参数/Lora）、部署国内外开源大模型（LLM）/多模态大模型（MLLM）教程

26451

10个月前

+75today

Libgdx

Desktop/Android/HTML5/iOS Java game development framework

24589

10个月前

+14today

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

Fine-tuning-Flan-T5-RLHF

Related projects

LLaMA Factory

Yolov5

Unsloth

Llama_index

Sheetjs

AgentGPT

Rufus

Xray Core

Self Llm

Libgdx

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Fine-tuning-Flan-T5-RLHF

Related projects

LLaMA Factory

Yolov5

Unsloth

Llama_index

Sheetjs

AgentGPT

Rufus

Xray Core

Self Llm

Libgdx

GEO Services