safe-rlhf

Public

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

ai-safety alpaca beaver datasets deepspeed gpt large-language-models llama llm llms

Heure de création：2023-05-15T19:47:08

Heure de mise à jour：2025-03-27T11:11:18

https://pku-beaver.github.io

1.5K

Stars

Stars Increase

Projets connexes

AutoGPT

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

176920

5个月前

+40today

Stable Diffusion Webui

Stable Diffusion web UI

154499

1年前

+46today

N8n

Hot

Fair-code workflow automation platform with native AI capabilities. Combine visual building with custom code, self-host or cloud, 400+ integrations.

118206

4年前

+360today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

106970

3个月前

+165today