Agent Becomes the New Core of AI! Volcano Engine Launches AgentKit, Tan Dai: The Future Computing Unit Will Shift from App to Intelligent Agent

AIbase基地

Published inAI News · 5 min read · Dec 22, 2025

Large model competitions are moving from "capability contests" to "practical application challenges." At the recent Volcano Engine Force Original Power Conference, Volcano Engine President Tan Dai systematically outlined a new paradigm in AI evolution: Intelligent Agents (Intelligent Entities) will become the core carrier for AI implementation. Multi-modal capabilities and an efficient Agent development system are the key to bridging the gap between technology and industry.

From "chatting" to "working": Large models enter a phase of tackling complex scenarios

Tan Dai pointed out that in the past, large models were mainly used for question-and-answer interactions. Now, they have penetrated high-complexity industries such as automotive, manufacturing, and catering. In these scenarios, AI needs to process text instructions, visual input, sensor data, and tool outputs simultaneously. For example, identifying equipment abnormalities in a factory and calling a maintenance ticket system, or generating nutritional analysis and recommendations based on images of dishes in a restaurant. This requires models to have human-like multi-modal understanding and environmental operation capabilities, rather than relying solely on pre-defined APIs.

Agent development becomes the biggest bottleneck, Volcano Engine launches AgentKit to break through

"The model's capabilities are already strong enough, but how to package them into stable and scalable agents remains a major industry bottleneck," Tan Dai admitted. To address this, Volcano Engine officially launched AgentKit—a smart agent development and operation framework derived from internal practices, providing full-chain components such as task planning, tool calling, memory management, secure sandboxing, and monitoring and tracking, significantly lowering the development and maintenance costs of agents.

Agents will become the "new computing unit" in the AI era

Tan Dai further predicted that the core infrastructure of the AI era will shift from web pages and mobile apps to intelligent agents. This means cloud architecture must be restructured—databases need to support agent state persistence, computing resources need to be dynamically scheduled according to tasks, and networks must ensure low-latency communication for multi-agent collaboration. "An agent is not a functional module, but a digital employee with goals, memory, and the ability to act," he said.

Safety must be inherently embedded in Agent design

Facing the risks of AI abuse, Tan Dai emphasized that traditional boundary protection has failed, and safety capabilities must be deeply integrated throughout the entire lifecycle of Agent operations. Volcano Engine has already integrated mechanisms such as input filtering, output compliance verification, approval for sensitive operations, and behavior auditing into AgentKit, ensuring reliable operation of Agents in open environments.

AIbase believes that Volcano Engine's latest release marks the transition of domestic large model vendors from "model suppliers" to "builders of intelligent agent operating systems." When AI no longer just answers questions but actively performs tasks, true industrial intelligence truly begins. The open-source nature and cloud-native integration of AgentKit may become a key accelerator for Chinese companies embracing the "Agent economy."

Amazon SageMaker has deployed the Voxtral model from Mistral AI

Mistral AI launched the Voxtral series of models, integrating text and audio processing capabilities. The series includes two models: Voxtral-Mini-3B-2507 and Voxtral-Small-24B-2507. The former is a 3-billion parameter model, suitable for fast audio transcription and basic multimodal understanding; the latter has 240 billion parameters, supporting advanced audio-text intelligence and multilingual processing, suitable for enterprise applications. Both models support audio context processing of 30 to 40 minutes.

Zhipu AI Launches GLM-4.7, a New Generation Open-Source Coding Large Model with Significant Performance Improvement

On December 22, Zhipu Huazhang released and open-sourced the new generation large model GLM-4.7. The model has shown outstanding performance in multiple international benchmark tests, especially in the coding field, with comprehensive performance surpassing GPT-5.2. It ranked first in both open-source and domestic models on the authoritative coding evaluation platform Code Arena, focusing mainly on programming scenarios.

Apple Collaborates with Purdue University to Develop DarkDiff Technology: Capturing Night Vision Quality Photos Even in Extremely Low Light

Apple and Purdue University have developed the DarkDiff technology, which enhances smartphone photography in extremely low light conditions by integrating a generative diffusion model into the camera image processing workflow. This technology processes raw image data directly, effectively solving issues such as detail blurring and artificiality caused by traditional night scene noise reduction, enabling the capture of clear details in the dark.

AI Daily: Qwen releases hierarchical image editing model Qwen-Image-Layered; Kling2.6 adds voice and action control features; Google launches A2UI open standard

Welcome to the [AI Daily] column! Here is your guide to explore the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technological trends and learn about innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1. Tongyi Qianwen by Alibaba releases the hierarchical image editing model Qwen-Image-Layered, generating Photoshop layers with one click. Tongyi Qianwen by Alibaba releases

Tongyi Qianwen Launches Qwen-Image-Layered Model for Image Layered Editing Breakthrough

Tongyi Qianwen launches the image generation model Qwen-Image-Layered, innovatively adopting the 'layer decomposition' technology to achieve precise editing of static images. This model uses the 'image disentanglement' approach to automatically split images into layers, effectively solving two major challenges in traditional AI editing: global modifications that disrupt consistency and local edits that struggle with occlusion and blurred boundaries, ushering in a new era of 'edit wherever you point.'

AI Auto-Operations Engineer Resolve AI Secures A-Round Funding Led by Lightspeed

AI operations startup Resolve AI completes its A-round funding, with a pre-money valuation of $1 billion, becoming a new unicorn. The round is led by Lightspeed Venture Partners, using a multi-tier pricing structure. The company was founded by former Splunk employees, focusing on automated operations (SRE). Its rapid growth reflects the high level of attention from the capital market towards the AI enterprise services sector.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Agent Becomes the New Core of AI! Volcano Engine Launches AgentKit, Tan Dai: The Future Computing Unit Will Shift from App to Intelligent Agent

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Amazon SageMaker has deployed the Voxtral model from Mistral AI

UBTech Subsidiary Youqi Joins Forces with Volcano Engine, Doubao Large Model Empowers New Field of Embodied Intelligence

U.S. Department of War Collaborates with xAI: Grok Model Will Be Integrated into Military-Grade AI Platform GenAI.mil

Alphabet Invests $4.75 Billion to Acquire Intersect, Strengthening the Green Energy Engine for AI Computing Power

Zhipu AI Launches GLM-4.7, a New Generation Open-Source Coding Large Model with Significant Performance Improvement

Apple Collaborates with Purdue University to Develop DarkDiff Technology: Capturing Night Vision Quality Photos Even in Extremely Low Light

AI Daily: Qwen releases hierarchical image editing model Qwen-Image-Layered; Kling2.6 adds voice and action control features; Google launches A2UI open standard

Tongyi Qianwen Launches Qwen-Image-Layered Model for Image Layered Editing Breakthrough

Beijing Humanoid Robot Launches the First VLA Large Model XR-1 in Accordance with National Standards

AI Auto-Operations Engineer Resolve AI Secures A-Round Funding Led by Lightspeed

AI News Recommendations

Amazon SageMaker has deployed the Voxtral model from Mistral AI

UBTech Subsidiary Youqi Joins Forces with Volcano Engine, Doubao Large Model Empowers New Field of Embodied Intelligence

U.S. Department of War Collaborates with xAI: Grok Model Will Be Integrated into Military-Grade AI Platform GenAI.mil

Alphabet Invests $4.75 Billion to Acquire Intersect, Strengthening the Green Energy Engine for AI Computing Power

Zhipu AI Launches GLM-4.7, a New Generation Open-Source Coding Large Model with Significant Performance Improvement

Apple Collaborates with Purdue University to Develop DarkDiff Technology: Capturing Night Vision Quality Photos Even in Extremely Low Light

AI Daily: Qwen releases hierarchical image editing model Qwen-Image-Layered; Kling2.6 adds voice and action control features; Google launches A2UI open standard

Tongyi Qianwen Launches Qwen-Image-Layered Model for Image Layered Editing Breakthrough

Beijing Humanoid Robot Launches the First VLA Large Model XR-1 in Accordance with National Standards

AI Auto-Operations Engineer Resolve AI Secures A-Round Funding Led by Lightspeed

GEO Services