safe-reward
Publica prototype for an AI safety library that allows an agent to maximize its reward by solving a puzzle in order to prevent the worst-case outcomes of perverse instantiation
Creat:2022-10-14T08:15:20
Update:2022-11-01T09:58:46
8
Stars
0
Stars Increase