TREX-Tree-Reward-EXploration
PublicUsing Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.
Using Tree estimators of the MDP models to then count leaves grouping similar transitions and do count-based exploration.