Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

BipedalWalker-RL

Public

This project implements agent training using the Proximal Policy Optimization (PPO) algorithm in the BipedalWalker-v3 environment at two difficulty levels: normal and hardcore. The model's performance is evaluated based on rewards collected during the training process.

Creat2024-09-24T20:56:42
Update2025-01-07T17:46:47
https://gymnasium.farama.org/environments/box2d/bipedal_walker/
0
Stars
0
Stars Increase

Related projects