rlhf-trl

Public

Reinforcement Learning from Human Feedback with ? TRL

human-feedback reinforcment-learning rlhf

Hora de creación：2023-06-10T23:16:02

Hora de actualización：2025-03-23T22:12:20

Stars

Stars Increase

Proyectos relacionados

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

47790

1个月前

+169today

Open Assistant

OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.

37331

1个月前

+7today

LLMSurvey

chain-of-thought

The official GitHub page for the survey paper "A Survey of Large Language Models".

11414

1个月前

+5today

PaLM Rlhf Pytorch

artificial-intelligence

Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM

7792

1个月前

+1today

Chinese LLaMA Alpaca 2

64k

中文LLaMA-2 & Alpaca-2大模型二期项目 + 64K超长上下文模型 (Chinese LLaMA-2 & Alpaca-2 LLMs with 64K long context models)

7158

1个月前

InternLM

chatbot

Official release of InternLM series (InternLM, InternLM2, InternLM2.5, InternLM3).

6883

1个月前

+2today

Alignment Handbook

llm

Robust recipes to align language models with human and AI preferences

5150

1个月前

+6today

Awesome RLHF

deep-learning

A curated list of reinforcement learning with human feedback resources (continually updated)

3907

1个月前

+5today

ChatGLM Efficient Tuning

alpaca

Fine-tuning ChatGLM-6B with PEFT | 基于 PEFT 的高效 ChatGLM 微调

3699

1个月前

Align Anything

chameleon

Align Anything: Training All-modality Model with Feedback

3511

1个月前

+27today

Noticias de IA

IA Diario

Cronología de la IA

Al hardware

Últimos Casos

Colección de Imágenes

Colección de Videos

Colección de Audio

Colección de Contenido

Últimos Tutoriales

Ranking de Productos de IA

Ranking de Crecimiento de Tráfico de IA

Ranking de Descenso de Tráfico de IA

Ranking Semanal de IA

Estados Unidos

China

India

Brasil

Generación de Imágenes

Asistente Personal

Generación de Personajes

Generación de Videos

Ranking de Proyectos de IA

Ranking de Crecimiento de Proyectos de IA

Ranking de Desarrolladores de IA

Ranking de Organizaciones de IA

Deepseek

TTS

LLM

ChatGPT

Visión General

rlhf-trl

Proyectos relacionados

LLaMA Factory

Open Assistant

LLMSurvey

PaLM Rlhf Pytorch

Chinese LLaMA Alpaca 2

InternLM

Alignment Handbook

Awesome RLHF

ChatGLM Efficient Tuning

Align Anything