Baidu Launches New Generation Multimodal AI Model ERNIE-4.5-VL

AIbase基地

Published inAI News · 3 min read · Nov 11, 2025

Recently, Baidu has taken another important step in the field of artificial intelligence and officially released its latest multimodal thinking model - ERNIE-4.5-VL-28B-A3B-Thinking. This new model not only has powerful language processing capabilities, but also introduces an innovative "image thinking" feature, meaning it has made significant improvements in understanding and processing images.

According to Baidu's introduction, the ERNIE-4.5-VL model uses only 3B activation parameters, demonstrating excellent computing efficiency and flexibility. This design allows the model to respond quickly and maintain efficiency when handling various tasks, fully meeting the growing demand for AI applications.

More notably, Baidu has added an "image thinking" feature to this model. Through this innovative capability, ERNIE-4.5-VL can not only enlarge images but also perform image search and other tool calls. Such technological breakthroughs will greatly enrich users' interaction experiences between images and text, providing new possibilities for applications in areas such as intelligent search, online education, and e-commerce.

In the context of rapid development in AI technology, Baidu continues to demonstrate its leadership in the multimodal AI field through ERNIE-4.5-VL. With the release of this model as open source, developers and researchers can more conveniently explore the potential of multimodal AI, promoting the development and application of related technologies.

The release of ERNIE-4.5-VL-28B-A3B-Thinking is not only an important technological innovation for Baidu, but also marks a new chapter in multimodal artificial intelligence. We look forward to seeing this technology play a greater role in various industries, helping people process information and solve problems in a smarter way.

ERNIE-4.5-VL MultimodalThinkingModel ImageThinking Baidu

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Moon's Dark Side Opens New AI Framework Kosong; Baidu Releases New Model ERNIE-4.5-VL; GPT-5.1 Makes a Stealth Appearance

AI Daily introduces Kosong, an open-source AI agent framework by Moonshot AI. It features asynchronous tool orchestration and plugin design, supports Python out-of-the-box, and enhances developer flexibility for AI innovation.....

Nov 11, 2025

AI Daily: State Administration for Radio, Film, and Television Crackdown on Chaos in AI Animation; 360 Releases Big Model Security White Paper; Baidu Launches Xiaodu AI Glasses Pro

China's broadcasting authority will regulate all AI-generated content, including parody shorts, starting March 2026 to strengthen oversight and address animation irregularities.....

Nov 10, 2025

120

Baidu Launches Xiaodu AI Glasses Pro at 2299 Yuan, Supports Multiple Smart Features

The Xiaodu AI Glasses Pro is launched, priced at 2299 yuan, offering two designs: Boston and Cat Eye. It comes with sunglasses or photochromic lenses, supports prescription lenses without the need for clip-ons. Weighing 39 grams, it uses titanium alloy hinges and adjustable nose pads to ensure comfortable wear. It is equipped with a Sony 12 megapixel lens, supporting 4K photo capture and 1440p/30fps video recording.

Nov 10, 2025

180

LMArena Latest Ranking: Wenxin Big Model 5.0 Leads in Textual Capabilities

The Wenxin ERNIE-5.0-Preview-1022 model has become the national champion in textual capabilities in the latest LMArena large model competition, and is tied for second place globally. The model performs outstandingly in creative writing and complex problem understanding, marking a new breakthrough in China's large model technology and demonstrating the strong potential of artificial intelligence development.

Nov 10, 2025

Baidu E-commerce Introduces Large Models to Reshape Risk Control Review Process, Achieving Win-Win for Merchants and Users!

"Baidu E-commerce Selection" brand uses large model technology to optimize risk control review, achieving full machine review, instant feedback, and high interpretability, solving the problems of low efficiency and slow response in traditional manual review, and enhancing e-commerce security and user experience.

Nov 4, 2025

170

AI Daily: Meituan's LongCat-Flash-Omni Released; Qwen3-Max Launches Deep Thinking Feature; Baidu Wenshi 5.0 Makes a Strong Return

Meituan's LongCat-Flash-Omni model uses ScMoE for real-time multimodal interaction, achieving top performance across various domains.....

Nov 3, 2025

300

Cloud Storage Acceleration: Baidu Netdisk Core API Compatible with MCP Protocol Empowers Developers to Integrate Instantly

Baidu Netdisk's core APIs fully support MCP protocol, streamlining developer integration, reducing barriers, and enhancing file upload, management, and search efficiency to drive cloud storage innovation.....

Nov 3, 2025

190

Baidu Wenyin 5.0 Makes a Strong Return! Generate Comics, Edit Photos, and Create Videos with One Click - The All-Round AI Assistant Has Been Fully Upgraded

The Baidu AI assistant Wenyin "5.0.0" version is officially released, achieving a leap in functionality. It has evolved from an intelligent assistant into a comprehensive platform that integrates creation, search, interaction, and multimedia generation. The newly launched "Magic Comic" feature significantly lowers the barrier to visual storytelling, allowing ordinary people to easily create comics and enhance user experience.

Nov 3, 2025

250

Anthropic Launches New Features for Claude to Provide Powerful Tools for Financial Analysts

Anthropic launches new features for the Claude AI assistant for financial analysis, including an Excel add-in, data connectivity, and AI skills, helping analysts with cash flow modeling and valuation comparisons. The Excel add-in is in the testing phase, driven by Sonnet4.5, and can analyze and edit data directly within spreadsheets, improving efficiency.

Oct 28, 2025

240

AI Daily: Douyin Video 1.0 Pro Fast Released; Google Gemini New Features Launched; Baidu Launches Super Sports Large Model 2.0

Doubao VideoGen 1.0pro fast model boosts generation speed by 3x, cuts costs 72%, and enhances video quality and scene adaptability, offering developers efficient, affordable AI video solutions.....

Oct 27, 2025

320

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Baidu Launches New Generation Multimodal AI Model ERNIE-4.5-VL

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Moon's Dark Side Opens New AI Framework Kosong; Baidu Releases New Model ERNIE-4.5-VL; GPT-5.1 Makes a Stealth Appearance

AI Daily: State Administration for Radio, Film, and Television Crackdown on Chaos in AI Animation; 360 Releases Big Model Security White Paper; Baidu Launches Xiaodu AI Glasses Pro

Baidu Launches Xiaodu AI Glasses Pro at 2299 Yuan, Supports Multiple Smart Features

LMArena Latest Ranking: Wenxin Big Model 5.0 Leads in Textual Capabilities

Baidu E-commerce Introduces Large Models to Reshape Risk Control Review Process, Achieving Win-Win for Merchants and Users!

AI Daily: Meituan's LongCat-Flash-Omni Released; Qwen3-Max Launches Deep Thinking Feature; Baidu Wenshi 5.0 Makes a Strong Return

Cloud Storage Acceleration: Baidu Netdisk Core API Compatible with MCP Protocol Empowers Developers to Integrate Instantly

Baidu Wenyin 5.0 Makes a Strong Return! Generate Comics, Edit Photos, and Create Videos with One Click - The All-Round AI Assistant Has Been Fully Upgraded

Anthropic Launches New Features for Claude to Provide Powerful Tools for Financial Analysts

AI Daily: Douyin Video 1.0 Pro Fast Released; Google Gemini New Features Launched; Baidu Launches Super Sports Large Model 2.0

GEO Services