makeMoE
PublicFrom scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)
Creat:2024-01-23T03:04:58
Update:2025-03-26T12:08:24
735
Stars
2
Stars Increase
From scratch implementation of a sparse mixture of experts language model inspired by Andrej Karpathy's makemore :)