Awesome-LLM-Inference
Public?A curated list of Awesome LLM/VLM Inference Papers with codes: WINT8/4, Flash-Attention, Paged-Attention, Parallelism, etc. ??
awesome-llmdeepseekdeepseek-r1deepseek-v3flash-attentionflash-attention-3flash-mlallm-inferenceminimax-01mla
Creat:2023-08-27T10:32:15
Update:2025-03-27T11:05:13
4.2K
Stars
6
Stars Increase