Alibaba Cloud Qwen 2.5-1M Open Source Release: 1 Million Context Length Model Debuts

AIbase基地

Published inAI News · 3 min read · Jan 27, 2025

826

Following the DeepSeek R1, the Alibaba Cloud Tongyi Qianwen team has just announced the launch of its latest open-source model Qwen2.5-1M, once again attracting industry attention.

The newly released Qwen2.5-1M series includes two open-source models: Qwen2.5-7B-Instruct-1M and Qwen2.5-14B-Instruct-1M. This is the first time that Tongyi Qianwen has introduced models that natively support million-token context lengths, achieving significant improvements in inference speed.

Alibaba Cloud, Tongyi Qianwen

The core highlight of Qwen2.5-1M is its native support for ultra-long context processing capabilities of million tokens. This enables the model to easily handle extremely long documents such as books, lengthy reports, and legal documents without cumbersome segmentation. Additionally, the model supports longer and deeper conversations, allowing it to remember longer dialogue histories and provide a more coherent and natural interaction experience. Furthermore, Qwen2.5-1M demonstrates stronger capabilities in understanding complex tasks such as code comprehension, intricate reasoning, and multi-turn dialogues.

In addition to the impressive million-token context length, Qwen2.5-1M brings another significant breakthrough: a lightning-fast inference framework! The Tongyi Qianwen team has fully open-sourced the inference framework based on vLLM and integrated a sparse attention mechanism. This innovative framework allows Qwen2.5-1M to achieve speed improvements of 3 to 7 times when processing million-token inputs! This means users can utilize ultra-long context models more efficiently, greatly enhancing the efficiency and experience in practical application scenarios.

DeepSeekR1 Aliyun Qwen2.5-1M OpenSourceModel

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Stanford's Latest Evaluation: DeepSeek R1 Medical AI Model Outperforms Google and OpenAI with High Scores

Recently, Stanford University released a comprehensive evaluation of clinical medical AI models. DeepSeek R1 stood out as the champion among nine leading large models, achieving a 66% win rate and a macro average score of 0.75. The highlight of this evaluation is that it not only focuses on traditional medical license exam questions but also delves into the daily work scenarios of clinical doctors, providing more practical assessments. The evaluation team developed an integrated assessment framework called MedHELM, which includes 35 benchmarks covering 22 subtasks in medicine.

Jun 4, 2025

410

Artificial Analysis: DeepSeek Becomes the World's Second Largest AGI Lab

The model and API independent analysis company, Artificial Analysis, released its latest evaluation report on DeepSeek R1-0528. The results show that DeepSeek R1 has made significant breakthroughs in technical performance, surpassing xAI, Meta, and Anthropic, and is now on par with Google, becoming the world's second largest artificial intelligence lab. At the same time, DeepSeek has also established its undisputed leadership position in the field of open-source weights.

May 30, 2025

510

Tencent Coins Officially Integrate with DeepSeek R1-0528 Latest Version

Tencent Coins officially announced the integration with the latest version of DeepSeek R1-0528, providing early access on both desktop and web versions. This model update brings three significant changes, offering users a more premium and efficient AI experience. In terms of reasoning ability, the new model demonstrates stronger capabilities. It can handle complex code issues and conduct in-depth analyses of various problems quickly and accurately, providing reliable solutions for users. This improvement makes Tencent Coins more practical in work and learning scenarios.

May 30, 2025

300

AI Daily: DeepSeek releases new version R1-0528; ByteDance releases image Agent Xiaoyunque AI; Keeling 2.1 is officially launched

May 29, 2025

630

DeepSeek R1T Chimera Launches on OpenRouter Platform: Combining R1 and V3-0324!

Apr 28, 2025

1.5k

Free! DeepSeek R1T Chimera Officially Launches on OpenRouter Platform

Developed by TNG Technology Consulting, the DeepSeek R1T Chimera model has officially launched on the OpenRouter platform, providing global developers with efficient and powerful inference capabilities. This new open-source model combines the excellent inference capabilities of DeepSeek R1 with the high performance of V3-0324, marking another significant breakthrough in the balance of performance and efficiency in open-source AI technology. The following is compiled by AIbase.

Apr 28, 2025

2.2k

Tsinghua and Shanghai AI Lab Jointly Develop Novel Process Reward Model, Enabling Smaller Models to Surpass GPT-4

Apr 14, 2025

580

Vector Institute Releases AI Model Performance Report to Boost Transparency and Trust

The rapid advancement of Artificial Intelligence (AI) models has led to concerns about the true performance of these models, despite continuous improvements by developers. To address this, the Vector Institute, founded by Geoffrey Hinton, has released a research study, "Assessing the State of the Art," which provides a comprehensive evaluation of 11 leading open-source and closed-source models through an interactive leaderboard. The evaluation covers mathematics, general knowledge, and coding.

Apr 11, 2025

380

Jack Ma Emphasizes AI Should Serve Humanity at Alibaba Cloud's New Fiscal Year Launch Event

Apr 11, 2025

510

AI Daily: Alibaba's Qwen3 Model Imminent; GitHub Opensources MCP Server; Runway Releases Gen-4 Turbo

Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest topics in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、Qwen3 is coming soon: Support for Alibaba Cloud's new model has been officially merged into the vLLM code repository. Alibaba Cloud's Qwen3 model is about to be released, marking another significant advancement in its AI endeavors.

Apr 8, 2025

2.9k

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview