gpt-neox
PublicAn implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Creat:2020-12-22T22:37:54
Update:2025-03-27T11:51:05
https://www.eleuther.ai/
7.3K
Stars
3
Stars Increase
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries