HomeAI Tutorial
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

armed-bandit

Public

Solving n-armed-bandit problems using different policies to find the path with the least regret. The policies used in this project were policy gradient and Thompson sampling. All the environments and agents are implemented with the aid of the Amalearn library. This project was carried out as part of the Reinforcement learning master course offered at the University of Tehran under the supervision of Prof Nili.

Creat2020-10-29T21:41:23
Update2021-09-18T23:19:37
0
Stars
0
Stars Increase