pytorch-a2c-ppo-acktr-gail
PublicPyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation (ACKTR) and Generative Adversarial Imitation Learning (GAIL).
a2cacktractor-criticadvantage-actor-criticaleataricontinuous-controldeep-learningdeep-reinforcement-learninghessian
Creat:2017-08-22T23:57:25
Update:2025-03-26T23:46:36
3.8K
Stars
1
Stars Increase