Infini-Attention
PublicEfficient Infinite Context Transformers with Infini-attention Pytorch Implementation + QwenMoE Implementation + Training Script + 1M context keypass retrieval
Creat:2024-04-13T09:59:42
Update:2025-03-20T13:36:17
https://arxiv.org/abs/2404.07143
83
Stars
0
Stars Increase