Baidu Launches the WENXIN Large Model 4.5 Series Open Source, Sparking a New Wave in the Domestic Large Model Market!

AIbase基地

Published inAI News · 5 min read · Jun 30, 2025

107

Recently, Baidu officially announced the open-sourcing of its ERNIE Bot 4.5 series, launching a total of ten models, including a mixture-of-experts (MoE) model with 47B and 3B activated parameters, as well as a dense model with 0.3B parameters. This open-source initiative not only fully releases the pre-training weights but also provides inference code, marking a significant advancement in Baidu's efforts in large model technology.

These newly released models can be downloaded and deployed on platforms such as PaddlePaddle Starry Community and Hugging Face. Additionally, Baidu Intelligent Cloud's Qianfan Large Model Platform offers corresponding API services. This move has made Baidu another major tech company in China, following Tencent, Alibaba, and ByteDance, that actively participates in open source, demonstrating its determination in the era of large model applications.

Metaverse, Science Fiction, Cyberpunk, Painting (5) Large Model

Image source note: The image is AI-generated, and the image licensing service provider is Midjourney.

Earlier this February, Baidu had already revealed the launch plan for the ERNIE Bot 4.5 series and announced the open-sourcing by June 30th. Although the upgraded version ERNIE Bot 4.5 Turbo was not included in this open-source list, it still sparked discussions among developers. Many developers believe that smaller parameter models are very suitable for memory-constrained configurations and perform well in terms of performance, which could compete with other large models like DeepSeek V3 and Alibaba Qwen.

The ERNIE Bot 4.5 series is a native multi-modal foundation model. In multiple tests, it has shown better performance than competitors such as GPT-4o. The model can not only understand text but also process various visual information such as photos and videos, demonstrating its strong capabilities in multi-modal understanding and generation.

Baidu's open-sourcing of the ERNIE Bot 4.5 series is mainly based on three key technological innovations: first, multi-modal heterogeneous MoE pre-training, enabling the model to effectively capture information from both text and visual modalities; second, an efficient infrastructure to achieve fast training and inference; and third, post-training tailored for specific modalities, allowing the model to perform better in diverse practical applications.

As the global competition in large model markets intensifies, Baidu's open-source initiatives undoubtedly put pressure on other closed-source model providers, raising the overall technical standards of the industry. This action also provides more freedom for developers and researchers, helping them iterate and apply models more quickly, promoting the advancement of artificial intelligence.

OpenAI Officially Confirms: Details of GPT-5 Thinking Model's Thought Process Exposed

OpenAI confirmed that the internal thought process files of GPT-5 have been leaked, emphasizing that this is an innovative feature of the model's design rather than a security vulnerability. The leaked content demonstrates the unique reasoning chain used by the model when solving complex logic tasks such as Sudoku, sparking widespread attention in the industry regarding the development of artificial intelligence's autonomous reasoning capabilities.

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

The MiniMax M2 model uses a full attention mechanism, abandoning linear or sparse attention techniques. The development team believes that although the latter can save computing resources, full attention is more efficient in industrial applications and can improve model performance. This decision aims to optimize actual deployment results and promote the development of AI technology.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Baidu Launches the WENXIN Large Model 4.5 Series Open Source, Sparking a New Wave in the Domestic Large Model Market!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

iFlytek Launches Deep Reasoning Large Model iFlytek X1.5 Supporting 130 Languages

iFlytek Launches a New Deep Reasoning Large Model: Xinghuo X1.5 Achieves New Heights in Performance!

OpenAI Officially Confirms: Details of GPT-5 Thinking Model's Thought Process Exposed

Global's First AI Ocean Large Model 'Kanhai' Launched! 10-Day Ocean Forecast Accurate to 600 Meters Deep

Google will build a large AI data center on Christmas Island or become a surveillance outpost

New Thinking Model - Moonlight Kimi K2 Thinking Released, the Boundaries of AI Are Pushed Again!

Paytm Collaborates with Groq to Drive High-Performance AI Model Development

Unveiling the Mystery of MiniMax M2: Why Choose Full Attention Mechanism?

Meituan LongCat Launches Innovative Benchmark Test UNO-Bench to Enhance Multimodal Large Language Model Evaluation Capabilities

Wang Xingxing: The Robot Large Model is Still in the Early Stage, and There is a Long Way to Go Before the ChatGPT Moment

GEO Services