SuperCLUE Multimodal Vision August Evaluation Ranking: Gemini-2.5-Pro Ranks First

AIbase基地

Published inAI News · 2 min read · Aug 29, 2025

135

In the Chinese multimodal vision language model evaluation benchmark (SuperCLUE-VLM) released on August 28, Gemini-2.5-Pro ranked first with a total score of 74.99, and OpenAI's GPT-5 (high) ranked second with a score of 68.59.

This benchmark builds an evaluation system around three core dimensions: basic cognition, visual reasoning, and visual application, tailored to the characteristics of Chinese scenarios, aiming to provide an objective and fair evaluation standard for the development of multimodal vision language models.

This evaluation covered a total of 15 multimodal models, including Claude-Opus-4.1, Gemini-2.5-Pro, GPT-5 (high), ERNIE-4.5-Turbo-VL, Doubao-Seed-1.6-thinking, hunyuan-t1-vision, Qwen-V1-Max-Latest, covering mainstream domestic and international models.

Finally, Gemini-2.5-Pro ranked first with a total score of 74.99, and OpenAI's GPT-5 (high) ranked second with a score of 68.59, while Baidu's ERNIE-4.5-Turbo-VL tied with other domestic models, showing strong market competitiveness.

SuperCLUE-VLM Gemini-2.5-Pro GPT-5 Multimodal Vision Language Model

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Shanghai AI Laboratory Launches Innovative ViraHInter Model to Enhance Efficiency in Antiviral Drug Development

The Shanghai Artificial Intelligence Laboratory, in collaboration with multiple institutions, has introduced the ViraHInter AI model, which can predict interactions between viruses and human proteins without wet experiments. By combining sequence and structure bimodal analysis, it has ushered in a new stage in antiviral drug development.

Apr 20, 2026

180

iQIYI AI Celebrity Library Plan Sparks Controversy, Multiple Celebrities Deny Authorization

iQIYI launched the 'AI Celebrity Library' plan, using its self-developed platform to create digital avatars of celebrities, aiming to improve the efficiency of film and television production. However, after the plan was announced, multiple participating celebrities quickly spoke out to deny their involvement, sparking a wide public discussion on the boundaries of AI technology application and protection of celebrities' rights.

Apr 20, 2026

190

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

The domestic AI model Qwen3.6-35B-A3B is officially open-sourced, using a hybrid expert architecture. It has a total of 35 billion parameters but activates only 3 billion during inference, achieving 'winning with small strength' high efficiency performance, significantly reducing computing costs.

Apr 20, 2026

210

Developers Attention! The API Rate Limiting Policy of Alibaba Cloud BaiLian Platform Will Undergo Significant Adjustments

Alibaba Cloud announced that starting from April 28, 2026, the multi-modal interaction development kit of the "BaiLian" large model service platform will be subject to rate limiting, with the newly established connection count uniformly adjusted to 10 QPS. This move aims to optimize resource scheduling and ensure service stability.

Apr 20, 2026

150

Aliyun Optimizes the BaiLian Multimodal Development Kit API Call Rate Limiting

Aliyun has adjusted the default API rate limiting for the BaiLian Multimodal Interaction Development Kit, effective April 28, 2026. The new limit is set to 10 calls per second (QPS) to optimize resource scheduling and service stability. It also supports 600 new sessions per minute and 36,000 new sessions per hour, meeting most development testing and daily business needs.

Apr 20, 2026

140

Tongyi Lab Launches Speech Recognition Large Model Fun-ASR1.5, Capable of Instantly Converting 30 Languages, Dialects, and Ancient Poetry!

Tongyi Lab launches the Fun-ASR1.5 speech recognition large model, achieving a balance between versatility and accuracy through a unified architecture. The model supports 30 mainstream languages globally and is deeply adapted to the seven major Chinese dialects and over 20 regional accents, demonstrating outstanding performance in multilingual, multi-dialectal, and complex contexts.

Apr 20, 2026

170

Huawei Sound X 5 Officially Announced for Pre-sale: AI Large Model Empowers New Audio Experience

The high-end smart speaker Huawei Sound X 5 starts pre-sale on April 20. The core highlight of the new product is the first deep integration of an AI large model, achieving an upgrade from "auditory interaction" to "smart companionship." In terms of acoustics, it continues the family's high specifications and is expected to adopt a multi-unit configuration.

Apr 20, 2026

170

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

The efficiency of large language model inference has made a breakthrough. Tsinghua University and Moonshot AI jointly proposed a new architecture called "Prefill-as-a-Service," which splits the inference process into two stages: prefilling and decoding, and optimizes the allocation of computing resources, effectively solving hardware limitations and significantly improving model service performance.

Apr 20, 2026

160

Singapore Financial Regulatory Authority Calls for Strengthened Bank Cybersecurity Against AI Model Risks

MAS urges Singapore banks to enhance cybersecurity against AI model Mythos risks, collaborating with agencies to protect critical infrastructure.....

Apr 20, 2026

160

AI Subconscious Can Transmit Poison Through the Air! Nature Heavyweight Paper Reveals: AI Bad Features Are Hidden in Pure Numbers, the Security Chain of Distillation Models Is Completely Compromised

Latest Nature study reveals LLMs exhibit 'subconscious learning', where harmful traits can transfer via seemingly benign data like numbers or code, posing new AI safety challenges.....

Apr 20, 2026

180

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

SuperCLUE Multimodal Vision August Evaluation Ranking: Gemini-2.5-Pro Ranks First

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Shanghai AI Laboratory Launches Innovative ViraHInter Model to Enhance Efficiency in Antiviral Drug Development

iQIYI AI Celebrity Library Plan Sparks Controversy, Multiple Celebrities Deny Authorization

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

Developers Attention! The API Rate Limiting Policy of Alibaba Cloud BaiLian Platform Will Undergo Significant Adjustments

Aliyun Optimizes the BaiLian Multimodal Development Kit API Call Rate Limiting

Tongyi Lab Launches Speech Recognition Large Model Fun-ASR1.5, Capable of Instantly Converting 30 Languages, Dialects, and Ancient Poetry!

Huawei Sound X 5 Officially Announced for Pre-sale: AI Large Model Empowers New Audio Experience

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

Singapore Financial Regulatory Authority Calls for Strengthened Bank Cybersecurity Against AI Model Risks

AI Subconscious Can Transmit Poison Through the Air! Nature Heavyweight Paper Reveals: AI Bad Features Are Hidden in Pure Numbers, the Security Chain of Distillation Models Is Completely Compromised

AI News Recommendations

Shanghai AI Laboratory Launches Innovative ViraHInter Model to Enhance Efficiency in Antiviral Drug Development

iQIYI AI Celebrity Library Plan Sparks Controversy, Multiple Celebrities Deny Authorization

Another Breakthrough in Domestic Large Models: Qwen3.6-35B-A3B is Officially Open Sourced, Focused on High Efficiency and Multimodal Thinking

Developers Attention! The API Rate Limiting Policy of Alibaba Cloud BaiLian Platform Will Undergo Significant Adjustments

Aliyun Optimizes the BaiLian Multimodal Development Kit API Call Rate Limiting

Tongyi Lab Launches Speech Recognition Large Model Fun-ASR1.5, Capable of Instantly Converting 30 Languages, Dialects, and Ancient Poetry!

Huawei Sound X 5 Officially Announced for Pre-sale: AI Large Model Empowers New Audio Experience

Moonshot AI Collaborates with Tsinghua University to Launch PrfaaS Architecture, Breaking the Bottleneck of Large Model Computing Power

Singapore Financial Regulatory Authority Calls for Strengthened Bank Cybersecurity Against AI Model Risks

AI Subconscious Can Transmit Poison Through the Air! Nature Heavyweight Paper Reveals: AI Bad Features Are Hidden in Pure Numbers, the Security Chain of Distillation Models Is Completely Compromised

GEO Services