HomeAI Tutorial

native-sparse-attention-pytorch

Public

Implementation of the sparse attention pattern proposed by the Deepseek team in their "Native Sparse Attention" paper

Creat2025-02-19T11:37:52
Update2025-03-27T05:09:21
788
Stars
1
Stars Increase