Tencent AI Lab Innovates with a Parallel Thinking Framework, Unlocking New Reasoning Methods for Large Models!

AIbase基地

Published inAI News · 4 min read · Sep 18, 2025

With the continuous advancement of AI technology, how to enable large models to have "parallel thinking" capabilities has become a hot topic among researchers. Recently, the Tencent AI Lab, in collaboration with research teams from multiple universities, introduced a new reinforcement learning (RL) framework called Parallel-R1, aimed at teaching large models to explore multiple reasoning paths simultaneously. This innovative framework opens up new approaches for tackling complex mathematical reasoning tasks.

Traditional methods often rely on supervised fine-tuning (SFT), which not only requires high-quality data but also causes models to merely imitate existing data, lacking autonomous learning and generalization abilities. To address these issues, the Parallel-R1 framework was developed. The key finding of the research team is that using simple prompts allows the model to generate high-quality parallel thinking data when solving simple math problems. Subsequently, through a "progressive curriculum" training mode, the model first learns the "syntax format" of parallel thinking from simple tasks and then gradually transitions to more complex math problems for reinforcement learning.

In addition, the team proposed an alternating reward strategy to address the issue of reward design, cleverly balancing "problem-solving accuracy" and "thinking diversity." During training, the model primarily receives "accuracy rewards," while in some cases, it also gets additional rewards for using parallel thinking. This strategy significantly improves the model's use of parallel thinking and leads to significant improvements in multiple mathematical benchmark tests.

Experimental results show that the Parallel-R1 framework not only increases the average accuracy by up to 8.4% on multiple math benchmarks but also achieves a performance jump of 42.9% on the AIME25 test. Researchers found that after training, the model's thinking strategies gradually shift from an initial "broad exploration" approach to a later "precise verification" method, fully demonstrating the advantages of parallel thinking.

The success of Parallel-R1 not only opens up a new direction for the reasoning capabilities of large models but also provides new insights for future AI research, highlighting the potential of parallel thinking in solving complex tasks.

YC Teen Abandons Selling AI Tools to Agrochemical Giants, Turns to Pesticide Sector and Secures $6 Million in Funding

Bindwell, founded by 18-year-old Tyler Rose and 19-year-old Navvye Anand, raised $6M in seed funding led by General Catalyst and A Capital, with Paul Graham participating. The startup pivoted from selling AI tools to developing proprietary pesticide molecules using AI-driven target design from drug discovery, scanning compound libraries in 6 hours with Foldwell structure prediction.....

Musk Denies Rumors of xAI's $15 Billion Funding: A False Response Report

Elon Musk denied the rumors that xAI has completed a $15 billion funding round. Previously, CNBC reported that xAI was raising funds to purchase GPU computing power to train the Grok model, with an estimated valuation of $200 billion. In the current AI funding boom, this news has attracted attention, compared to OpenAI's recent $6.6 billion funding and a valuation of $50 billion.

100 Million Interactions, 80% Faster: AI Voice Pioneer Vida Secures $4 Million in Series A Funding

Vida, an AI voice automation firm, raised $4M in Series A funding led by Trammell Venture Partners. Its AI voice assistant has handled over 100 million customer interactions, positioning it as a core platform for global enterprise voice agents. Funds will accelerate product innovation and expand industry applications.....

Two Rounds of Nearly 1 Billion Yuan: Dexmal Origin Machine Secures Additional Investment from Alibaba: Embodied Intelligence is Entering a New Window Period

Embodied intelligence company Dexmal Origin Machine recently completed an A+ round funding of several hundred million yuan, led by Alibaba. So far, the company has accumulated nearly 1 billion yuan in total funding from the A round and A+ round, which will be used for the research and development and industrialization of intelligent robot software and hardware technologies. Previous A round funding was led by NIO Capital, with multiple institutions joining and existing shareholders increasing their investment.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Tencent AI Lab Innovates with a Parallel Thinking Framework, Unlocking New Reasoning Methods for Large Models!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Tencent's Q3 Report Reveals New AI Ecosystem Opportunities, Significant Growth in Enterprise Services Revenue

TRAE Releases SOLO Stable Version: Real-time Perception + Multi-Agent Initiates a New Professional-level AI Coding Mode

YC Teen Abandons Selling AI Tools to Agrochemical Giants, Turns to Pesticide Sector and Secures $6 Million in Funding

Study: 97% of listeners can't distinguish AI music from human-created music

AI Daily: Feifei Li's Marble 3D World Model Public Beta; OpenAI Launches ChatGPT Group Chat Function for the First Time; Baidu Unveils Multimodal AI Assistant, Super Du

Musk Denies Rumors of xAI's $15 Billion Funding: A False Response Report

100 Million Interactions, 80% Faster: AI Voice Pioneer Vida Secures $4 Million in Series A Funding

Israeli AI startup Wonderful raises $100 million in Series A funding in just 10 months

Two Rounds of Nearly 1 Billion Yuan: Dexmal Origin Machine Secures Additional Investment from Alibaba: Embodied Intelligence is Entering a New Window Period

LinkedIn Launches AI Contact Search: A Smarter Way to Connect for 1.3 Billion Users

GEO Services