AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
Datasets
EN

AI News

View More

NeurIPS 2025 Best Paper Revealed: Tongyi Qianwen from Alibaba Wins China's Only Major Award

Alibaba's 'Attention Gating Makes Better Foundation Models' won Best Paper at NeurIPS 2025. It introduces a 'sliding gate' mechanism to dynamically filter key attention heads and tokens, enabling a 1.7B dense model to match 15B MoE performance. Among 20,000 submissions, it was one of four awards and the sole Chinese winner.....

18.4k 2 minutes ago
NeurIPS 2025 Best Paper Revealed: Tongyi Qianwen from Alibaba Wins China's Only Major Award

Moonshot Introduces a New Hybrid Linear Attention Architecture Kimi Linear

Kimi Linear, a hybrid linear attention architecture by Moon AI, outperforms traditional methods in long/short-range processing and reinforcement learning. It uses Kimi Delta Attention with gating to enhance RNN memory efficiency, combining three KDA and one MLA.....

12.3k 10 hours ago
Moonshot Introduces a New Hybrid Linear Attention Architecture Kimi Linear

Models

View More

SeerAttention QwQ 32B AttnGates

SeerAttention

S

Introducing an attention gating (AttnGates) weight adapter for the QwQ-32B model to accelerate long-context computation through dynamic block-level sparsity

Natural Language ProcessingTransformersTransformers
SeerAttention
35
3
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map