Domestic Mathematical Gold Medal Emerges: DeepSeek-Math-V2 Open-Source File Has Been Uploaded, Performance Competes with GPT-4o

AIbase基地

Published inAI News · 3 min read · Nov 28, 2025

On November 27, the DeepSeek team released a massive 236B parameter model called DeepSeek-Math-V2 on Hugging Face, using a MoE architecture with only 21B active parameters, and expanding the context length to 128K tokens. The official Apache 2.0 weights were also released, with no commercial restrictions, which immediately overwhelmed the server bandwidth.

Overview of mathematical performance (zero-shot CoT):

- Achieved 75.7% on the MATH benchmark, nearly matching GPT-4o (76.6%);

- Solved 4 out of 30 problems in AIME2024, more than Gemini 1.5 Pro and Claude-3-Opus;

- Achieved 53.7% on Math Odyssey, also ranking among the top tier.

The core secret of the model is the "self-validation" dual engine: the Generator first produces a draft, and the Verifier checks line by line, sending errors back for rewriting, with up to 16 iterations, suppressing hallucinations through majority voting and meta-verification. The training corpus reached 100 billion tokens, including papers, competition questions, and synthetic data, and introduced GRPO reinforcement learning to align with human preferences.

Thanks to the mixed code-math corpus, DeepSeek-Math-V2 is also powerful in programming: 90.2% on HumanEval, 76.2% on MBPP, and SWEBench saw an open-source model break the 10% barrier for the first time, directly competing with GPT-4-Turbo and Claude 3 Opus.

The model is now available on Hugging Face, requiring only 80GB of GPU memory for multi-GPU inference; community reproductions are rapidly underway. If you want to give AI a "math gold medal" brain, all you need is one line of `transformers` loading—domestic open source once again cracks the moat of closed-source giants down to microscopic cracks.

DeepSeek-Math-V2 MoEArchitecture Self-VerificationDualEngine Apache2.0

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

DeepSeek-Math-V2 Launches: Open Source Model Conquers International Mathematical Olympiad for the First Time with a Gold Medal

DeepSeek launches the world's first open-source large model for mathematical reasoning, DeepSeek-Math-V2, with 685 billion parameters, achieving the level of a gold medal in the International Mathematical Olympiad. Based on the DeepSeek-V3.2 architecture, this model is open-sourced under the Apache 2.0 license. Its core breakthrough is an innovative 'Generate-Verify' dual-model closed-loop mechanism, significantly enhancing mathematical reasoning capabilities.

Nov 28, 2025

170

Domestic Sora Has Arrived! Zhipu Qingying 2.0 Can Generate 1080P Videos with a Single Sentence and Also Comes with AI Sound Effects

Zhipu AI launches Qingying 2.0, powered by CogVideoX, generating 10s 1080P videos with precise motion, style, and shot control. Supports multi-video generation, outperforms Sora in Chinese prompt accuracy and speed, and includes CogSound for text-to-video creation.....

Nov 28, 2025

110

13GB VRAM Beats Hundreds of Billions: Dahua's 'Xinghan 2.0' Answers AI Deployment with a Single Report

Dahua Tech boosts Q3 net profit by 44% to 1.06B yuan, deploying 6B vision models in 16GB edge devices. Since 2019, its Transformer-based self-training system has evolved into V/M/L series for efficient edge AI.....

Nov 27, 2025

160

4MP Open Source Blitz: Flux.2 Now Free, Google 3000 Dollar Nano Banana Pro Suddenly Less Attractive

Flux.2 Open Source Sparks AI Image Generation Revolution: 4MP Images Generated in 8 Seconds for Only $0.003, Cost is Just One Thousandth of Google's Solution. Open Source Community Quickly Verified Its Performance, Professional Version Sampling Steps Reduced to 8 Steps, Speed Increased Significantly. Netizens Directly Pointed Out Google's High Pricing, This Technological Breakthrough is Reshaping the Industry Competitive Landscape.

Nov 27, 2025

160

Li Auto's Self-Developed AI Chip M100 Exposed, Performance Three Times Higher Compared to High-End Models

Li Auto's Q3 2025 financial report shows total revenue of 27.4 billion yuan, a 36.2% year-over-year decline; net loss of 624.4 million yuan, compared to a profit of 2.8 billion yuan in the same period last year. In a conference call, management emphasized that the company is accelerating its transformation in autonomous driving and AI fields. The self-developed AI inference chip M100 has made significant progress, indicating future strategic adjustments.

Nov 27, 2025

210

Singapore's National AI Plan Switches Chips: Ditching Meta Llama in Favor of Alibaba's Qwen3-32B Open-Source Model Sea-Lion v4 Tops Southeast Asia Language Ranking

AISG launches Qwen-Sea-Lion-v4, shifting from Meta Llama to Alibaba's Qwen3-32B. It tops Sea-Helm's <200B open-source leaderboard, excelling in Southeast Asian languages due to Qwen3's 119-language pretraining and optimized tokenization.....

Nov 26, 2025

190

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

FLUX.2 series released with 32B dev weights, supporting 10-image reference and 4MP editing for image generation and manipulation.....

Nov 26, 2025

230

Black Forest Labs Releases FLUX.2 Model with a 66.6% Win Rate Leading Open-Source Benchmarks

Black Forest Labs launches FLUX.2, a new AI image generation and editing system with four models, featuring multi-reference conditioning, higher fidelity, and enhanced text rendering for professional creative workflows.....

Nov 26, 2025

280

Developer Version of Stable Diffusion! FLUX. 2 Open Source Release: 10 Image References + 4MP Editing

Black Forest Labs launches FLUX.2 series with four models, including a 32B open-source version. Key features: 95%+ image consistency via 10-image reference, pose/lighting/color control, 4MP editing with local redraw and background replacement, planned PSD export.....

Nov 26, 2025

310

ByteDance PICO Strategic Upgrade: Launch Self-Developed Chip and New VR Headset in 2026

ByteDance accelerates the self-development and high-end positioning of VR hardware. The PICO brand under ByteDance plans to launch a new generation of headsets in 2026, equipped with a fully self-developed dedicated chip. This chip was initiated in 2022, completed the first chip return in 2024, and entered mass production. It meets the performance targets, with its core advantage being low latency performance.

Nov 26, 2025

210

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Domestic Mathematical Gold Medal Emerges: DeepSeek-Math-V2 Open-Source File Has Been Uploaded, Performance Competes with GPT-4o

AIbase基地

This article is from AIbase Daily

AI News Recommendations

DeepSeek-Math-V2 Launches: Open Source Model Conquers International Mathematical Olympiad for the First Time with a Gold Medal

Domestic Sora Has Arrived! Zhipu Qingying 2.0 Can Generate 1080P Videos with a Single Sentence and Also Comes with AI Sound Effects

13GB VRAM Beats Hundreds of Billions: Dahua's 'Xinghan 2.0' Answers AI Deployment with a Single Report

4MP Open Source Blitz: Flux.2 Now Free, Google 3000 Dollar Nano Banana Pro Suddenly Less Attractive

Li Auto's Self-Developed AI Chip M100 Exposed, Performance Three Times Higher Compared to High-End Models

Singapore's National AI Plan Switches Chips: Ditching Meta Llama in Favor of Alibaba's Qwen3-32B Open-Source Model Sea-Lion v4 Tops Southeast Asia Language Ranking

AI Daily: FLUX.2 Open Source Release; Tencent Hunyuan 3D Creation Engine Launched on International Site; Baidu Establishes Two New Large Model R&D Departments

Black Forest Labs Releases FLUX.2 Model with a 66.6% Win Rate Leading Open-Source Benchmarks

Developer Version of Stable Diffusion! FLUX. 2 Open Source Release: 10 Image References + 4MP Editing

ByteDance PICO Strategic Upgrade: Launch Self-Developed Chip and New VR Headset in 2026

GEO Services