AIbase
Product LibraryTool NavigationMCP

pipegoose

Public

Large scale 4D parallelism pre-training for ? transformers in Mixture of Experts *(still work in progress)*

Creat2023-06-14T14:14:50
Update2024-12-21T01:24:28
84
Stars
1
Stars Increase

Related projects