Proximal-Policy-Optimization-PPO-for-BipedalWalker-v3
Publichis repository contains an implementation of the Proximal Policy Optimization (PPO) algorithm to solve the BipedalWalker-v3 environment from the Gymnasium library. This project uses a combination of policy and value networks to learn a policy for controlling a bipedal walker.
Creat:2025-03-07T13:07:53
Update:2025-03-10T21:58:48
1
Stars
0
Stars Increase