LLM-Infra
Public?A curated list of Awesome LLM/VLM Inference Papers with Codes: Flash-Attention, Paged-Attention, WINT8/4, Parallelism, etc.?
awesome-llmdeepseekdeepseek-r1deepseek-v3flash-attentionflash-attention-3flash-mlallm-inferenceminimax-01mla
Creat:2023-08-27T10:32:15
Update:2025-08-05T14:22:11
4.3K
Stars
0
Stars Increase