PyTorch-RL
PublicPyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.
a2cdeep-reinforcement-learningfisher-vectorsgenerative-adversarial-networkpolicy-gradientppoproximal-policy-optimizationpytorchpytorch-rlreinforcement-learning
Creat:2017-10-17T23:50:29
Update:2025-03-24T23:04:27
1.2K
Stars
0
Stars Increase