PPO-LunarLander

Public

使用PyTorch，基于近端策略优化（PPO）的强化学习智能体在LunarLander-v2环境中实现火箭着陆

actor-critic-algorithm lunar-lander openai-gym-environments ppo pytorch reinforcement-learning

Creat：2025-04-07T22:27:39

Update：2025-04-08T06:37:22

Stars

Stars Increase

Related projects

AutoGPT

Hot

AutoGPT is the vision of accessible AI for everyone, to use and to build on. Our mission is to provide the tools, so that you can focus on what matters.

180179

10个月前

+59today

Dify

Hot

agent

Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.

120901

8个月前

+280today

Generative Ai For Beginners

Hot

21 Lessons, Get Started Building with Generative AI ? https://microsoft.github.io/generative-ai-for-beginners/

102828

10个月前

+172today

Openai Cookbook

chatgpt

Examples and guides for using the OpenAI API

69550

8个月前

+43today

Lobe Chat

Hot

? Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / DeepSeek / Qwen), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Plugins/Artifacts) and Thinking. One-click FREE deployment of your private ChatGPT/ Claude / DeepSeek application.

68780

9个月前

+205today