Doubao Large Model 2.0 Officially Released, Inference Cost Reduced by an Order of Magnitude, API Now Opened

AIbase基地

Published inAI News · 6 min read · Feb 14, 2026

445

Volcano Engine officially launched the Doubao Large Model 2.0 (Doubao-Seed-2.0) series, and simultaneously introduced API services for enterprises and developers. Individual users can experience it through the Volcano Ark Experience Center or the Doubao App's "Expert" mode.

This version has undergone systematic optimization to meet the needs of large-scale production environments. With capabilities such as efficient reasoning, multimodal understanding, and complex instruction execution, it can better handle real-world complex tasks. The reasoning cost is reduced by about one order of magnitude compared to industry-leading models, and the daily Tokens usage has increased more than 500 times since its initial release.

The Doubao Large Model 2.0 offers four differentiated models, adapted to different scenarios of latency and cost requirements: the Pro version, as the flagship model, focuses on complex deep reasoning and Agent tasks; the Lite version outperforms version 1.8, with improved capabilities and fewer Tokens consumption, offering excellent cost-effectiveness; the Mini version prioritizes speed and cost, with capabilities comparable to version 1.6 Pro; the Code version is optimized for developers, suitable for real programming environments, and performs even better when used with TRAE.

This update has achieved a comprehensive upgrade in multimodal understanding capabilities, reaching world-class levels in visual understanding. The Pro version leads Gemini3pro in evaluations such as spatial understanding MMSIBench, motion understanding MotionBench, and video understanding VideoMME, and its chart understanding CharXiv-RQ capability has also significantly improved.

Regarding video scenarios, the model has enhanced time series and motion perception understanding, leading in key evaluations such as TVBench, with EgoTempo benchmark scores exceeding humans. In long video evaluations, it surpasses most top models, enabling real-time video stream analysis, active guidance, and other interactions. It is suitable for companionship scenarios such as fitness and fashion, and can accurately infer billiard movements, identify sports actions, and provide professional guidance.

The model's LLM and Agent capabilities have also been significantly enhanced. By adding long-tail domain knowledge, it better adapts to professional scenarios: the Pro version scores higher than GPT5.2 in SuperGPQA, ranks first in HealthBench, and its performance in scientific fields matches that of Gemini3Pro and GPT5.2; HLE-text leads globally with 54.2 points, IMO evaluation surpasses Gemini3pro, and shows excellent performance in tool calling and instruction following. In some scenarios, STEM benchmark scores exceed Gemini3Pro.

At the same time, the model has enhanced consistency and controllability in instruction following, excelling at long-chain multi-step tasks. It can complete continuous workflows such as "finding information - summarizing - drawing conclusions," and combine tools to complete full-process tasks from data processing, content creation to image generation and layout. Intelligent customer service agents built based on it can achieve full-cycle services including customer conversations, issue transfer, and after-sales follow-up. Additionally, the Code version model can stably call mainstream IDE tools, with significant optimizations in front-end capabilities, supporting custom skills. When combined with TRAE, it can greatly improve development efficiency, requiring only five prompt rounds to build complex web applications like "AI Temple Fair," with related materials already open-sourced.

To address the surge in Tokens usage in the Agent era, Volcano Engine has also updated the Coding Plan package. Developers can call this model through Volcano Ark. New users can use it for as low as 8 yuan in the first month, achieving precise model matching for different programming tasks.

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Lingguang App - The 'Lingguang Circle' community is being renewed: launching hot lists, follow functions, PC support for importing documents and audio/video materials. The Lingguang App has upgraded the functions of the 'Lingguang Circle' community, adding hot lists and editorial selections.

MiniMax Code 2.0 Desktop Version Is Here: Long Tasks No Longer Freeze or Interrupt, Remote Control and Browser Takeover Will Be Revealed This Month

MiniMax launched Code 2.0 desktop on July 16, targeting frustrations in long-chain coding tasks: context loss, stuck waits, and undone work needing manual fixes. The new version focuses on wait, interruption, and context continuity, refining the engine for coherent, stable long tasks with fewer breaks.....

Ant Brain Full-Stack 2.0 Makes Its Debut at WAIC, Demonstrating Smart Pharmacy with a Single Brain for Multiple Machines

The 2026 World AI Conference opens on July 17, highlighting the top 10 "Treasures of the Exhibition." Selections include Ant Group's robot smart pharmacy powered by the Lingbo cross-embodiment model, and Sugon's fully domestic 100,000-card AI supercluster, evaluated on technology, market potential, replicability, and social value.....

Robot Vision Achieves New Breakthrough! Ant Group's LingBot-Depth 2.0 Spatial Perception Model Officially Released

On July 7, Ant Group's Lingbo Technology released LingBot-Depth 2.0, a spatial perception model trained on 150M data. It enhances edge clarity, small object recognition, long-range depth estimation, and robustness in complex scenes. Serving as robot eyes, the v1.0 solved transparency and reflection issues; v2.0 further upgrades.....

Making Agents Stronger with Use: AReaL 2.0 Open Source - Building a RL Infrastructure for Self-Evolving Intelligent Agents

AReaL 2.0, an open-source RL infrastructure, was released on July 2. It bridges foundation model training and agent applications, providing RL support for agents. For real-world business, it offers continuous learning by recording and organizing agent interactions and integrating them into training pipelines, enabling agents to evolve continuously.....

AI Daily: Meituan Releases LongCat-2.0; Xiaohongshu RedKnot Inference Engine Open Source; Doubao App Integrates Map Navigation

Welcome to the 【AI Daily】 section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Meituan officially released the trillion-parameter open-source large model LongCat-2.0, supporting 1M ultra-long context natively. Meituan officially released and open-sourced the next-generation trillion-parameter large model LongCat-2

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Doubao Large Model 2.0 Officially Released, Inference Cost Reduced by an Order of Magnitude, API Now Opened

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

MiniMax Code 2.0 Desktop Version Is Here: Long Tasks No Longer Freeze or Interrupt, Remote Control and Browser Takeover Will Be Revealed This Month

MiniMax Releases Code2.0 Desktop Version: Comprehensive Reconstruction of the Underlying Architecture, Native Integration with Multi-source Financial Data

Ant Brain Full-Stack 2.0 Makes Its Debut at WAIC, Demonstrating Smart Pharmacy with a Single Brain for Multiple Machines

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

Robot Vision Achieves New Breakthrough! Ant Group's LingBot-Depth 2.0 Spatial Perception Model Officially Released

New Milestone in Domestic Computing Power: Meituan's LongCat-2.0 Large Model is Officially Open Sourced

Mistral AI's Open-Source Mathematical Proof Tool: 119B Parameters Activate Only 6B, Problem-Solving Cost Is Just 1% of Competitors'

Making Agents Stronger with Use: AReaL 2.0 Open Source - Building a RL Infrastructure for Self-Evolving Intelligent Agents

AI Daily: Meituan Releases LongCat-2.0; Xiaohongshu RedKnot Inference Engine Open Source; Doubao App Integrates Map Navigation

AI News Recommendations

AI Daily: MiniMax Code 2.0 Desktop Version Released; Kimi K3 Model Teaser Video Leaked; Tongyi Qianwen Officially Integrated into Apple Ecosystem

MiniMax Code 2.0 Desktop Version Is Here: Long Tasks No Longer Freeze or Interrupt, Remote Control and Browser Takeover Will Be Revealed This Month

MiniMax Releases Code2.0 Desktop Version: Comprehensive Reconstruction of the Underlying Architecture, Native Integration with Multi-source Financial Data

Ant Brain Full-Stack 2.0 Makes Its Debut at WAIC, Demonstrating Smart Pharmacy with a Single Brain for Multiple Machines

Pre-train from Scratch: Ant Lingbo Releases the Embodied Native World Action Model LingBot-VA 2.0

Robot Vision Achieves New Breakthrough! Ant Group's LingBot-Depth 2.0 Spatial Perception Model Officially Released

New Milestone in Domestic Computing Power: Meituan's LongCat-2.0 Large Model is Officially Open Sourced

Mistral AI's Open-Source Mathematical Proof Tool: 119B Parameters Activate Only 6B, Problem-Solving Cost Is Just 1% of Competitors'

Making Agents Stronger with Use: AReaL 2.0 Open Source - Building a RL Infrastructure for Self-Evolving Intelligent Agents

AI Daily: Meituan Releases LongCat-2.0; Xiaohongshu RedKnot Inference Engine Open Source; Doubao App Integrates Map Navigation