cleanrl
PublicHigh-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
a2cactor-criticadvantage-actor-criticaleatarideep-learningdeep-reinforcement-learninggymmachine-learningphasic-policy-gradient
Creat:2019-06-08T00:31:50
Update:2025-03-27T09:44:34
http://docs.cleanrl.dev
7.6K
Stars
6
Stars Increase