MiniMax Launches M3 Large Model: Pioneering the MSA Architecture and Supporting 1M Context, Fully Open-Source to Compete with Overseas Flagships

AIbase基地

Published inAI News · 3 min read · Jun 1, 2026

141

MiniMax Xiyu Technology officially launched its next-generation cutting-edge large model, MiniMax M3, on June 1, 2026. This is the first open-source large model in China that integrates top-tier programming, 1M ultra-long context, and native multi-modal capabilities, aiming to comprehensively match overseas closed-source flagship models.

Regarding the context expansion bottleneck in complex intelligent agent tasks, M3 has developed a sparse attention architecture (MSA) at the underlying level. Compared with traditional solutions, it achieves more accurate KV partitioning and operator-level optimization, with a computing speed four times higher than similar open-source solutions. At a 1M context length, the computational cost per token is only one-tenth of the previous generation model. It achieves over 9 times acceleration in the prefill stage and 15 times acceleration in the decoding stage.

Under the mixed training of native trillion-scale interleaved data, M3's semantic space is highly integrated. It surpasses GPT-5.5 and Gemini 3.1 Pro in authoritative software engineering and multi-modal evaluations such as SWE-Bench Pro. In extreme task tests, M3 demonstrates strong long-thread autonomous planning capabilities. It not only autonomously reproduced the experiments of an ICLR top paper in 12 hours but also ran continuously for 24 hours without reference code, calling tools nearly 2,000 times. It improved the FP8 matrix multiplication hardware utilization on the Hopper architecture from 7.6% to 71.3%, and autonomously scheduled the model to complete the full "data-training-iteration" process on the open PostTrainBench.

MiniMaxM3 Xiyu Technology Open-Source Large Model MSA

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

NVIDIA released the Nemotron3Embed embedding model series for production-grade RAG, agent retrieval, code search, and memory. The 8B model ranks first on the RTEB benchmark, making it the top open-source embedding model. The series includes three checkpoints: accuracy-focused 8B-BF16, lightweight 1B-BF16, and 1B-NVFP4 4-bit optimized for Blackwell architecture. All models use bidirectional attention.....

Jul 17, 2026

400

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

Welcome to the [AI Daily] segment! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you grasp technological trends and understand innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1, 2.8 trillion parameters, 1 million token context, KimiK3 pushes the ceiling of open source large models to the highest globally. This article introduces the KimiK3 model released by Moonshot AI.

Jul 17, 2026

470

WeRide Launches Physical AI Large Model WITT

WeRide launches WeRide WITT, a physical AI cognitive foundation model. Its core introduces the 'minimum physical fact unit' concept, enabling AI to better understand multimodal data like video, images, and text, improving cognition for complex autonomous driving scenarios.....

Jul 17, 2026

230

WeRide Releases the Physical AI Perception Foundation Model WIIT, Building a Framework for Understanding the Real World

WeRide unveiled its WIIT large model at WAIC 2026, introducing the 'minimal physical fact unit' concept. It breaks continuous environments into foundational facts, building a framework with fact extraction, reasoning, and verification modules, advancing AI from data understanding to real-world cognition.....

Jul 17, 2026

270

StepZen Releases STEPX Neo Prototype - The World's First Large Model-Native Intelligent Phone

StepFun at WAIC2026 showcased its first large-model native agent phone, STEPX Neo, with orange design. The system includes modules like "Seafood Market" and can integrate third-party agents via partnerships, exploring deep AI agent integration with mobile terminals.....

Jul 17, 2026

280

Shen Dou of Baidu: Each Employee Is Given a Monthly Allowance of 1000 Yuan to Freely Experience Mainstream Large Models - Forcing the Adoption of AI in the Office Is Hard to Yield Results

Baidu's Shen Dou predicts a surge in general and industry-specific AI agent deployments by H2 2026. The next three years are key, with 90% of work deeply augmented by AI, not fully replaced.....

Jul 17, 2026

270

Reduce the First Token Latency by 3.25 Times: Xiaohongshu Collaborates with Peking University and Shanghai Jiao Tong University to Propose HYPIC, Equipping Hybrid Attention Large Models with Location-Independent Caching

The main battlefield of large model services is shifting toward retrieval-augmented question answering, multi-document summarization, and long-range agents. The request prompt is composed of dozens to hundreds of semantically independent segments (retrieved documents, skills explanations, memory, historical rounds), forming ultra-long context with tens of thousands to hundreds of thousands of tokens. The pre-filling stage dominates the computing costs, becoming the most prominent cost source for service providers, and triggering more challenging problems.

Jul 17, 2026

230

Soul Makes Its Debut at WAIC 2026, Unveiling the SoulX Multimodal Interaction Large Model and AI Hardware B Soul

At WAIC 2026, Soul launched B Soul, an AI hardware showcasing real-time multimodal interaction and emotion perception. CTO Tao Ming said the company evolved from a social app into an ecosystem focused on emotion sensing, interaction tech, and self-developed large interaction models, distinct from general-purpose LLMs.....

Jul 17, 2026

330

Zhipu Acquires Zhongke Jihe for Hundreds of Millions of Yuan to Strengthen AI Infra and Domestic Chip Compatibility

Zhipu AI spent hundreds of millions of yuan acquiring AI infra firm Zhongke Jiahe to strengthen domestic computing adaptation and inference optimization. Originating from CAS's ICT, it specializes in compilation tech, with experience in compilers for Loongson and Huawei Ascend. It provides full-chain capabilities from virtual instruction sets to deployment, filling model low-level engineering gaps.....

Jul 17, 2026

230

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

Roblox launched AI creation tool Build and upgraded its Studio, using AI to lower the development barrier. Users can generate editable game content via text prompts. The feature begins testing on July 28, deepening the 'user-generated content' philosophy. The platform has 132 million daily active users. Build is a mobile-first tool, enabling creation anytime, anywhere.....

Jul 17, 2026

250

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

MiniMax Launches M3 Large Model: Pioneering the MSA Architecture and Supporting 1M Context, Fully Open-Source to Compete with Overseas Flagships

AIbase基地

This article is from AIbase Daily

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

WeRide Launches Physical AI Large Model WITT

WeRide Releases the Physical AI Perception Foundation Model WIIT, Building a Framework for Understanding the Real World

StepZen Releases STEPX Neo Prototype - The World's First Large Model-Native Intelligent Phone

Shen Dou of Baidu: Each Employee Is Given a Monthly Allowance of 1000 Yuan to Freely Experience Mainstream Large Models - Forcing the Adoption of AI in the Office Is Hard to Yield Results

Reduce the First Token Latency by 3.25 Times: Xiaohongshu Collaborates with Peking University and Shanghai Jiao Tong University to Propose HYPIC, Equipping Hybrid Attention Large Models with Location-Independent Caching

Soul Makes Its Debut at WAIC 2026, Unveiling the SoulX Multimodal Interaction Large Model and AI Hardware B Soul

Zhipu Acquires Zhongke Jihe for Hundreds of Millions of Yuan to Strengthen AI Infra and Domestic Chip Compatibility

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes

AI News Recommendations

NVIDIA Releases Nemotron 3 Embed Series 8B Version, Tops RTEB Retrieval Benchmark

AI Daily: Open Source Model Kimi K3 Makes Its Debut; Google Vids Introduces Gemini Omni Model; Zhipu AI Aims for $1 Billion ARR

WeRide Launches Physical AI Large Model WITT

WeRide Releases the Physical AI Perception Foundation Model WIIT, Building a Framework for Understanding the Real World

StepZen Releases STEPX Neo Prototype - The World's First Large Model-Native Intelligent Phone

Shen Dou of Baidu: Each Employee Is Given a Monthly Allowance of 1000 Yuan to Freely Experience Mainstream Large Models - Forcing the Adoption of AI in the Office Is Hard to Yield Results

Reduce the First Token Latency by 3.25 Times: Xiaohongshu Collaborates with Peking University and Shanghai Jiao Tong University to Propose HYPIC, Equipping Hybrid Attention Large Models with Location-Independent Caching

Soul Makes Its Debut at WAIC 2026, Unveiling the SoulX Multimodal Interaction Large Model and AI Hardware B Soul

Zhipu Acquires Zhongke Jihe for Hundreds of Millions of Yuan to Strengthen AI Infra and Domestic Chip Compatibility

Roblox Launches AI Creation Tool Build, Supporting Text-Generated Games and 3D Scenes