Home
Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Robust-Asynchronous-Q-Learning-with-Markovian-Data

Public

?????? ?????-?/?????-???/?: The first provably robust variants of asynchronous Q-learning that tolerates adversarially corrupted rewards. Our algorithm is distribution-agnostic, and achieves near-optimal finite-time guarantees up to a provably unavoidable corruption-dependent additive term.

Creat2025-09-04T05:48:02
Update2025-09-12T05:53:15
1
Stars
0
Stars Increase