Baidu Releases Global Leading Document Parsing Model PaddleOCR-VL, Reshaping the OCR Technology Landscape!

AIbase基地

Published inAI News · 4 min read · Oct 17, 2025

Recently, Baidu officially released and open-sourced its self-developed multimodal document parsing model PaddleOCR-VL. This model ranked first in the world for comprehensive performance on the authoritative document parsing evaluation list OmniBenchDoc V1.5 with an impressive score of 92.6, demonstrating excellent performance in four core capabilities: text, tables, formulas, and reading order.

PaddleOCR-VL has a core model parameter count of only 0.9B, making it lightweight and efficient. It can accurately identify complex elements such as text, handwritten Chinese characters, tables, formulas, and charts with minimal computational cost. The model supports 109 languages, including Chinese, English, French, Japanese, Russian, Arabic, and Spanish, and is suitable for various intelligent document processing tasks such as government and enterprise document management, knowledge retrieval, archive digitization, and research information extraction.

As a derivative model of Wenxin 4.5, PaddleOCR-VL-0.9B successfully achieved breakthroughs in both accuracy and efficiency by integrating the NaViT dynamic resolution visual encoder with the ERNIE-4.5-0.3B language model. Specifically, the model performed exceptionally well on OmniDocBench v1.5, with a text edit distance of 0.035, a CDM of 91.43 for formula recognition, a TEDS of 93.52 for tables, and a reading order prediction error value of 0.043. These data demonstrate its stability and reliability in high-difficulty scenarios such as complex document, handwritten manuscript, and historical archive recognition.

In terms of inference speed, PaddleOCR-VL can process 1881 Tokens per second on a single A100 GPU, showing significant improvements compared to other mainstream models. It is 14.2% faster than MinerU2.5 and 253.01% faster than dots.ocr. This performance has set a new benchmark in OCR technology.

Different from traditional OCR technology, PaddleOCR-VL can understand complex layout structures like humans, accurately extract diverse information such as financial tables, mathematical formulas, and class notes, and automatically restore the order that conforms to human reading habits, ensuring the accuracy of information delivery and the clarity of logic. Its innovative two-stage architecture first detects the layout and predicts the reading order, and then identifies and structurally outputs elements such as text, tables, and formulas, which significantly improves the stability and efficiency of recognition.

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

Samsung Electronics posted strong 2025 results, driven by surging chip demand from the global AI race. Full-year operating profit rose 33.2% to KRW 43.6 trillion, with sales up 10.9% to KRW 333.6 trillion and net profit increasing 31.2% to KRW 45.2 trillion. Q4 performance was particularly strong, with operating profit hitting a record high.....

Baidu Intelligent Cloud Accelerates: AI Revenue Growth Target Doubles to 200%

Baidu Intelligent Cloud announced at an internal strategy meeting that it has significantly increased its AI-related revenue growth target for 2026 from 100% to 200%, demonstrating an aggressive approach to compete for the leading position in the AI cloud market. Its confidence comes from an optimistic forecast of market potential, with IDC predicting that the global AI cloud market size is expected to exceed $400 billion by 2030.

Baidu Intelligent Cloud Significantly Raises AI Revenue Expectations: Growth Target Doubles

Baidu Intelligent Cloud has significantly increased its AI-related revenue growth target for 2026 from 100% to 200%, aiming to consolidate and expand its leading position in the AI cloud market and strive for the number one spot in the industry. Data from 2025 shows that Baidu has won 109 large model-related projects among the main cloud vendors in China, demonstrating strong market performance.

Baidu Wenyin APP Launches Industry's First Multi-Person Multi-Agent Group Chat Beta Test

Baidu's Wenxin APP launched a new beta test on January 27, introducing the industry's first 'multi-user, multi-Agent' AI group chat feature. This innovation breaks the traditional one-on-one interaction model, allowing multiple specialized AI agents (e.g., group chat assistants, health managers) to coexist and collaborate in a single chat, forming a multi-dimensional AI think tank. The AI assistants not only deeply understand context but also pos....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Baidu Releases Global Leading Document Parsing Model PaddleOCR-VL, Reshaping the OCR Technology Landscape!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Kimi publicly called out the wrong person: The top four results from Baidu search are not official websites. After the response, the issue was quickly removed

The New Infrastructure of a Century-Old Giant: Dacheng Construction Relies on ChatGPT to Revitalize Talent Vitality

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

Altman's Start of the Year Sales: The Main Model is the Thin iPhone Air

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

Baidu Intelligent Cloud Accelerates: AI Revenue Growth Target Doubles to 200%

Baidu Intelligent Cloud Significantly Raises AI Revenue Expectations: Growth Target Doubles

Efficiency First! Sam Altman Says AI Has Helped OpenAI Significantly Slow Down Hiring

Baidu Wenyin APP Launches Industry's First Multi-Person Multi-Agent Group Chat Beta Test

Baidu Intelligent Cloud Soars: AI Revenue Target Doubles by 2026, Aims for Top Industry Position

AI News Recommendations

Kimi publicly called out the wrong person: The top four results from Baidu search are not official websites. After the response, the issue was quickly removed

The New Infrastructure of a Century-Old Giant: Dacheng Construction Relies on ChatGPT to Revitalize Talent Vitality

Ant Group Launches LingBot-VLA: Dual-Arm Robot Control Enters the Era of Large Models

Altman's Start of the Year Sales: The Main Model is the Thin iPhone Air

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

Baidu Intelligent Cloud Accelerates: AI Revenue Growth Target Doubles to 200%

Baidu Intelligent Cloud Significantly Raises AI Revenue Expectations: Growth Target Doubles

Efficiency First! Sam Altman Says AI Has Helped OpenAI Significantly Slow Down Hiring

Baidu Wenyin APP Launches Industry's First Multi-Person Multi-Agent Group Chat Beta Test

Baidu Intelligent Cloud Soars: AI Revenue Target Doubles by 2026, Aims for Top Industry Position

GEO Services