TD3-Bipedal-Walker
PublicTrains an agent with Twin Delayed Deep Deterministic Policy Gradient (TD3) to solve the Bipedal Walker challenge from OpenAI
Creat:2021-12-31T03:02:37
Update:2024-10-04T00:38:23
12
Stars
0
Stars Increase
Trains an agent with Twin Delayed Deep Deterministic Policy Gradient (TD3) to solve the Bipedal Walker challenge from OpenAI