AIBase
Home
AI NEWS
AI Tools
GEO & AEO
MCP
AI Models
EN

AI News

View More

Meituan Launches Native Multimodal LongCat-Next: Visual and Speech Achieve Bottom-Level Unification

Meituan launches LongCat-Next, a native multimodal AI model that uses DiNA technology to unify images, audio, and text into discrete tokens, enabling deep integration of multimodal modeling for enhanced perception of the physical world.....

19.9k 5 minutes ago
Meituan Launches Native Multimodal LongCat-Next: Visual and Speech Achieve Bottom-Level Unification

Models

View More

Dinat Mini In1k 224

shi-labs

D

DiNAT-Mini is a hierarchical vision Transformer model based on neighborhood attention mechanism, specifically designed for image classification tasks.

Computer VisionTransformersTransformers
shi-labs
462
1
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map