Adan
PublicAdan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
adanartificial-intelligencebert-modelconvnextcuda-programmingdeep-learningdiffusiondreamfusionfairseqgpt2
Creat:2022-09-01T18:34:27
Update:2025-03-26T21:30:23
797
Stars
0
Stars Increase