HJxB
PublicContinuous-Time/State/Action Fitted Value Iteration via Hamilton-Jacobi-Bellman (HJB)
continuous-controlcontinuous-value-iterationflaxhamilton-jacobihamilton-jacobi-bellmanjaxoptimal-controlreinforcement-learningvalue-iteration
Creat:2021-05-13T01:40:59
Update:2024-06-19T18:46:33
15
Stars
0
Stars Increase