AIbase
Product LibraryTool NavigationMCP

Mixture-of-MQA

Public

An implementation of a switch transformer like Multi-query attention model

Creat2025-02-20T23:37:37
Update2025-02-26T08:40:50
https://swarms.ai
8
Stars
0
Stars Increase

Related projects