AIBase
Home
AI NEWS
AI Tools
GEO & AEO
MCP
AI Models
EN

AI News

View More

Alibaba Cloud Launches New Mathematical Reasoning Model Qwen2.5-Math-PRM, 7B Version Surpasses GPT-4o

Today, the Alibaba Cloud Tongyi team officially released the new mathematical reasoning process reward model Qwen2.5-Math-PRM. This model offers two sizes, 72B and 7B, with performance significantly outperforming similar open-source process reward models, especially excelling in identifying reasoning errors. The 7B version of Qwen2.5-Math-PRM astonishingly surpasses the widely popular GPT-4o, marking an important step in Alibaba Cloud's research and development of reasoning models.

15.8k 3 days ago
Alibaba Cloud Launches New Mathematical Reasoning Model Qwen2.5-Math-PRM, 7B Version Surpasses GPT-4o

Alibaba Qwen Team Releases New Process Reward Model, Advancing Mathematical Reasoning

The Alibaba Qwen team recently published a paper titled 'Lessons Learned from the Development of Process Reward Models in Mathematical Reasoning' and introduced two new models in the Qwen2.5-Math-PRM series, featuring 7B and 72B parameters respectively. These models break through the limitations of the existing PRM framework in mathematical reasoning, significantly improving the accuracy and generalization ability of reasoning models through innovative techniques. Mathematical reasoning has long been a major challenge for large language models (LLMs), especially regarding errors in intermediate reasoning steps.

17.2k 10 hours ago
Alibaba Qwen Team Releases New Process Reward Model, Advancing Mathematical Reasoning
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAI MarketingLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map