AIbase
プロダクトライブラリツールナビゲーションMCP

omega

Public

A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environments.

作成時間2021-06-18T05:12:21
更新時間2025-01-30T21:40:43
41
Stars
0
Stars Increase