MA-RLHF

Public

[ICLR'25] MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

llm-training ma-rlhf ppo rlhf

Heure de création：2024-09-27T11:48:28

Heure de mise à jour：2025-03-09T03:23:44

https://openreview.net/forum?id=WWXjMYZxfH

Stars

Stars Increase

Projets connexes

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

107430

3个月前

+182today

Gpt4all

ai-chat

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.

73833

3个月前

+8today

Browser Use

Hot

ai-agents

Make websites accessible for AI agents

65729

4个月前

+77today

LLMs From Scratch

Hot

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

59176

10个月前

+91today

MetaGPT

agent

? The Multi-Agent Framework: First AI Software Company, Towards Natural Language Programming

57286

3个月前

+24today

LLaMA Factory

Hot

agent

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

54488

3个月前

+90today

Vllm

Hot

amd

A high-throughput and memory-efficient inference and serving engine for LLMs

52538

1年前

+95today

Autogen

Hot

agentic

A programming framework for agentic AI ? PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour

47454

1年前

+65today