robot-rlhf

Public

Robot Learning from Human Feedback. Inspired by advancements in NLP, we train a robot policy via reinforcement learning using a reward function learned exclusively from human preferences.

alignment chatgpt reinforcement-learning rlhf robotics

作成時間：2023-04-16T10:39:45

更新時間：2024-10-25T10:44:16

Stars

Stars Increase

関連プロジェクト

Tensorflow

deep-learning

An Open Source Machine Learning Framework for Everyone

190776

1年前

+12today

Stable Diffusion Webui

Stable Diffusion web UI

154604

1年前

+45today

Transformers

Hot

bert

? Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

147095

1年前

+57today

30 Seconds Of Code

astro

Coding articles to level up your development skills

124596

3个月前

+13today

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

91941

5个月前

+51today

Pytorch

autograd

Tensors and Dynamic neural networks in Python with strong GPU acceleration

91575

3个月前

+10today

NextChat

Opencv

c-plus-plus

Open Source Computer Vision Library

83082

7年前

+15today

Netdata

alerting

X-Ray Vision for your infrastructure!

75124

3个月前

+14today

D2l Zh

book

《动手学深度学习》：面向中文读者、能运行、可讨论。中英文版被70多个国家的500多所大学用于教学。

70807

3个月前

+33today

AIツールを探す

AIツールを提出

AI モデル検索

MCPサーバー

MCPクライアント

MCPインスペクター

ケーススタディ

最新AIニュース

AI日刊要約

robot-rlhf

関連プロジェクト

Tensorflow

Stable Diffusion Webui

Transformers

30 Seconds Of Code

Generative Ai For Beginners

Pytorch

NextChat

Opencv

Netdata

D2l Zh