Diffusion-Language-Model
PublicImplementation of a LLaDA-inspired Masked Diffusion Model for Text using PURE BYTE-LEVEL TOKENIZATION (cuz why not) and Mixed Precision Training for speed.
diffusiondiffusion-langauge-modeldiffusion-llmdiffusion-lmlangauge-modellladallmmachine-learningmasked-diffusionmasked-diffusion-llm
Creat:2025-04-29T04:52:17
Update:2025-05-18T09:57:34
1
Stars
0
Stars Increase