TREX-Tree-Reward-EXploration
PublicUsing Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.
Creat:2024-02-07T01:41:25
Update:2024-02-07T18:37:10
1
Stars
0
Stars Increase
Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.