Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
List of Computer Science courses with video lectures.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
OpenAssistant is a chat-based assistant that understands tasks, can interact with third-party systems, and retrieve information dynamically to do so.
Python sample codes and textbook for robotics algorithms.
Conversational RPA SDK for Chatbot Makers. Join our Discord: https://discord.gg/7q8NBZbQzt
Bullet Physics SDK: real-time collision detection and multi-physics simulation for VR, games, visual effects, robotics, machine learning etc.
ArduPlane, ArduCopter, ArduRover, ArduSub source
强化学习中文教程(蘑菇书?),在线阅读地址:https://datawhalechina.github.io/easy-rl/
The official GitHub page for the survey paper "A Survey of Large Language Models".
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.