Mistral Releases Devstrall2 Open-Source Programming Model: 123 Billion Parameters, Cost Only 1/7 of Claude Sonnet

AIbase基地

Published inAI News · 6 min read · Dec 10, 2025

Mistral AI officially launched its second generation of open-source coding model family: Devstral2 (12.3B parameter flagship version) and Devstral Small2 (2.4B parameter lightweight version). The flagship model achieved a score of 72.2% on the SWE-Bench Verified benchmark, setting a new record for open-source models; the official claims that it is seven times more cost-effective than Claude Sonnet, and also released the open-source CLI tool Mistral Vibe, which supports natural language batch code modification. Both models are now available via API, with Devstral2 priced at $0.40 per million input tokens, while the lightweight version is completely free.

Model Overview: One Large, One Small, Open-Source Dual Track

Performance Breakthrough: 72.2% Sets New Record for Open-Source Code Models

- SWE-Bench Verified: Devstral2 scored 72.2%, surpassing CodeLlama-70B (53.8%) and DeepSeek-Coder-33B (61.4%), only 1pp behind GPT-4-Turbo (73.2%)

- HumanEval: 84.1% Pass@1, leading other open-source models by 6-8pp

- Cost: Officially claimed to be seven times cheaper than Claude Sonnet; at approximately $0.40/M, it's about 1/5 of GPT-4-Turbo's cost

Open-Source Tool: Mistral Vibe —— Natural Language Batch Code Modification

- Features: A single instruction like "convert the function to async" can automatically rewrite an entire repository, supporting diff preview and rollback

- Engine: Locally calls Devstral Small2 (Apache 2.0), no internet connection required

- Integration: VS Code plugin is already available, supporting one-click fixes for ESLint errors or adding unit tests

Business Strategy: Lightweight Free + Flagship API, Tiered Revenue Generation

- Devstral Small2: Apache 2.0, commercial use, fine-tuning, and embedding allowed

- Devstral2: Modified MIT license, requires purchase of a commercial license or use of the official API if monthly revenue exceeds $20 million, to prevent large companies from using it for free

- API Pricing: $0.40 per million input tokens, $1.20 per million output tokens; first 30 days offer 1 million token free quota

Industry Signal: Open-Source Coding Models Enter the '70+ Club'

- In 2024, mainstream open-source code models on SWE-Bench generally scored between 50-60%; Devstral2 raised the bar to over 72%

- Low cost and high scores could challenge the cost-effectiveness of paid plugins like GitHub Copilot and Cursor

- The lightweight version is completely free, potentially accelerating the adoption of "local AI coding assistants," allowing developers to run a 24B model on an RTX 4090

Next Steps: 2025 Roadmap

- Q1: Release Devstral2-INT4 quantized version, runs on a single A100; launch Jetson Orin edge deployment package

- Q2: Launch 128k context version, supporting the entire codebase plus documentation as prompts

- Q3: Launch "Vibe Cloud" —— Natural language code refactoring within the browser, billed by project

Editor's Conclusion

When "code generation" reaches 70+ points, the key factor shifts from "model capability" to "cost and compliance." Devstral2 brings the price down to a fraction with $0.40 per million tokens, and through the "modified MIT" license, blocks large companies from using it for free. The lightweight version is fully open-sourced, capturing the market for local deployment. For developers, the "free 24B + low-cost 1230B" combination means: write code locally and let the cloud handle heavy tasks, no longer needing to pay for Copilot subscriptions. AIbase will continue to track its quantized version and the 128k long-context release.

OpenAI Sora2API Launches New Updates Including Character Consistency, 20-Second Duration, and Dual Output for Horizontal and Vertical Screens

OpenAI upgrades the Sora video generation API, introducing five core capabilities based on the Sora2 model, focusing on solving issues of character consistency, duration, and format compatibility in batch video production. The key improvement is character consistency, allowing developers to define character profiles in advance to avoid visual drift such as facial or clothing changes of the main character across different scenes, significantly improving large-scale production efficiency.

Sold 2 Billion in 3 Years! JD.com Teams Up with Haier at AWE to Create a Stir: AI Kitchens Are About to Take Over Your Stomach

Haier Small Appliances and JD.com's Kitchen Small Appliances signed a strategic partnership, planning to achieve a sales volume of 2 billion yuan across JD.com's full channels in the next three years. Both sides will integrate platforms, brands, and AI technology to jointly promote the deep evolution of smart kitchens.

AI Daily: WeChat Secretly Developing AI Agent; Fish Audio Launches S2; Honor Magic V6 Begins Internal Testing of On-Device AI Intelligence

Welcome to the [AI Daily] column! Here is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technological trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. WeChat secretly developing AI Agent: Intended to connect millions of mini programs, will start testing in mid-2026 WeChat is secretly developing a high-priority AI Agent product, aimed at fully integrating into WeChat

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

Google launches the native multimodal embedding model Gemini Embedding 2, which supports text, images, videos, audio, and documents, mapping them uniformly into a vector space to achieve deep cross-media understanding. Unlike generative models, it focuses on 'understanding,' converting data into vectors to help systems identify semantic relationships.

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Google launches Gemini Embedding2, the first multimodal embedding model based on the Gemini architecture, now in preview on Gemini API and Vertex AI. It maps text, images, videos, audio, and documents into a unified embedding space for cross-modal retrieval and classification, supporting over 100 languages.....

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Tesla's production plan changes led to Samsung adjusting its 2nm production line schedule, forcing Korean AI chip firm DeepX to delay mass production of its next-gen NPU chip DX-M2 by six months, with testing expected only after Q3 2026. This highlights how large clients in the semiconductor foundry industry prioritize scheduling, impacting smaller enterprises.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Mistral Releases Devstrall2 Open-Source Programming Model: 123 Billion Parameters, Cost Only 1/7 of Claude Sonnet

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI Sora2API Launches New Updates Including Character Consistency, 20-Second Duration, and Dual Output for Horizontal and Vertical Screens

Sold 2 Billion in 3 Years! JD.com Teams Up with Haier at AWE to Create a Stir: AI Kitchens Are About to Take Over Your Stomach

Valued at 2 Billion Dollars! Rivian Founder Re-creates a Unicorn: Mind Robotics Secures 500 Million Dollars in Funding, Focusing on Industrial AI

Google Releases Gemini Embedding2: Native Multimodal Embedding Model Unifies Text, Image, and Audio-Visual Semantic Spaces

AI Daily: WeChat Secretly Developing AI Agent; Fish Audio Launches S2; Honor Magic V6 Begins Internal Testing of On-Device AI Intelligence

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

True Emotional Freedom! Fish Audio Releases S2: Multi-Speaker, Word-Level Emotion Control, Fully Open Source

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Valuation of 14.6 Billion USD: AI Compute Newcomer Nscale Completes 2 Billion USD Series C Funding

AI News Recommendations

OpenAI Sora2API Launches New Updates Including Character Consistency, 20-Second Duration, and Dual Output for Horizontal and Vertical Screens

Sold 2 Billion in 3 Years! JD.com Teams Up with Haier at AWE to Create a Stir: AI Kitchens Are About to Take Over Your Stomach

Valued at 2 Billion Dollars! Rivian Founder Re-creates a Unicorn: Mind Robotics Secures 500 Million Dollars in Funding, Focusing on Industrial AI

Google Releases Gemini Embedding2: Native Multimodal Embedding Model Unifies Text, Image, and Audio-Visual Semantic Spaces

AI Daily: WeChat Secretly Developing AI Agent; Fish Audio Launches S2; Honor Magic V6 Begins Internal Testing of On-Device AI Intelligence

Google Releases Its First Native Multimodal Embedding Model Gemini Embedding 2: Enabling Machines to Truly Understand the World

True Emotional Freedom! Fish Audio Releases S2: Multi-Speaker, Word-Level Emotion Control, Fully Open Source

Google Gemini Embedding 2 Launches with Great Impact! The First Full-Multimodal Embedding Model Is Here

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Valuation of 14.6 Billion USD: AI Compute Newcomer Nscale Completes 2 Billion USD Series C Funding

GEO Services