Latest AI News

Tracking Global AI Breakthroughs and Industry Transformation

AI Daily Brief

AI insights in 3 minutes daily

Information

AI Product Finder

Curated AI Open Source Solutions for Enterprise Intelligence

AI Product Rankings

Authoritative AI tools ranking, one-stop selection

AI Product Submit

Submit AI products, build intelligent ecosystem together

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Building and Deploying AI

Deploy 100+ open-source software on a dedicated instance in <3 mins

Information

AI Models Finder

Open Source Pre-trained Models for Faster AI Deployment

LLM Leaderboard

Comparison and ranking the performance of over 100 AI models

Model Providers

Connect with Top LLM Providers Worldwide

Submit Your Model

Submitting your AI Model, monetize value quickly

Tools

Compare LLMs

Compare LLM Capabilities, Choose Models Effortlessly

LLM Cost Calculator

Calculate LLM Costs Instantly, Stay Within Budget

LLM Arena

AI Performance Showdown: Battle-Tested, Best-in-Class

Information

MCP Servers

Best mcp servers powering enterprise development and deployment

MCP Client

Multi-model orchestration, complex business simplified

MCP Case Tutorials

Step-by-step guide to master core development and practical skills

MCP Ranking

Explore the most popular MCP servers ranked

MCP Service Submission

Submit MCP services, monetize value quickly

Tools

MCP Playground

Connect AI to Tools Instantly: Your Zero-Barrier MCP Playground

MCP Inspector

One-Click Integration: Seamlessly Bridge AI and Tools

Kunlun Wildfire Launches Skywork-R1V 3.0: Cross-modal Reasoning Capabilities Approaching Those of Human Experts!

AIbase基地

Published inAI News · 3 min read · Jul 9, 2025

Recently, Kuaizhi Wanyi officially released its brand-new open-source model Skywork-R1V3.0, claiming to have reached an unprecedented level in multimodal reasoning, even comparable to the level of human junior experts. During the training process, the model adopted a reinforcement learning strategy, achieving significant progress in complex logical modeling and cross-disciplinary knowledge generalization.

Skywork-R1V3.0 was "bootstrapped" based on the previous generation Skywork-R1V2.0, using high-quality distilled data and rejection sampling techniques to successfully build a powerful multimodal reasoning training set. The design of this model is not limited to text, but also includes image processing, significantly improving its ability to reason between images and text.

According to the introduction, the training of Skywork-R1V3.0 relies on only about 12,000 supervised fine-tuning samples and 13,000 reinforcement learning samples, demonstrating the unique advantage of "small data triggering great capability." In the authoritative comprehensive multimodal evaluation MMMU, Skywork-R1V3.0 scored 76.0, leading over closed-source models such as Claude-3.7-Sonnet (75.0) and GPT-4.5 (74.4), proving its outstanding cross-modal understanding ability.

In specific application scenarios, Skywork-R1V3.0 has shown excellent performance in multiple fields such as physics, logic, and mathematical reasoning. For example, in the physics reasoning evaluation, the model achieved the best open-source scores of 52.8 and 31.5, showing its ability to understand complex physics problems. Additionally, in the logic reasoning test, Skywork-R1V3.0 also achieved an excellent score of 59.7.

The model is also formidable in mathematical reasoning, achieving excellent scores of 77.1, 59.6, and 52.6 in evaluations such as MathVista, MathVerse, and MathVision, significantly outperforming other open-source models. These outstanding performances make Skywork-R1V3.0 a strong competitor in the current open-source multimodal reasoning field.

The release of Skywork-R1V3.0 marks a new peak in multimodal reasoning technology. Its powerful performance and open-source nature will greatly promote the further development of AI technology.

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team