HomeAI Tutorial

StreamAttn

Public

A high-performance attention mechanism that computes softmax normalization in a single streaming pass using running accumulators (online softmax).

Creat2025-02-10T22:01:30
Update2025-09-04T16:47:32
28
Stars
0
Stars Increase