recurrent-pretraining
PublicPretraining code for a large-scale depth-recurrent language model
Creat:2025-02-08T02:03:44
Update:2025-03-26T03:28:17
https://huggingface.co/tomg-group-umd/huginn-0125
807
Stars
1
Stars Increase
Pretraining code for a large-scale depth-recurrent language model