Following the DeepSeek R1, the Alibaba Cloud Tongyi Qianwen team has just announced the launch of its latest open-source model
The newly released Qwen2.5-1M series includes two open-source models:
Following the DeepSeek R1, the Alibaba Cloud Tongyi Qianwen team has just announced the launch of its latest open-source model
The newly released Qwen2.5-1M series includes two open-source models:
Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.
Recently, Stanford University released a comprehensive evaluation of clinical medical AI models. DeepSeek R1 stood out as the champion among nine leading large models, achieving a 66% win rate and a macro average score of 0.75. The highlight of this evaluation is that it not only focuses on traditional medical license exam questions but also delves into the daily work scenarios of clinical doctors, providing more practical assessments. The evaluation team developed an integrated assessment framework called MedHELM, which includes 35 benchmarks covering 22 subtasks in medicine.
The model and API independent analysis company, Artificial Analysis, released its latest evaluation report on DeepSeek R1-0528. The results show that DeepSeek R1 has made significant breakthroughs in technical performance, surpassing xAI, Meta, and Anthropic, and is now on par with Google, becoming the world's second largest artificial intelligence lab. At the same time, DeepSeek has also established its undisputed leadership position in the field of open-source weights.
Tencent Coins officially announced the integration with the latest version of DeepSeek R1-0528, providing early access on both desktop and web versions. This model update brings three significant changes, offering users a more premium and efficient AI experience. In terms of reasoning ability, the new model demonstrates stronger capabilities. It can handle complex code issues and conduct in-depth analyses of various problems quickly and accurately, providing reliable solutions for users. This improvement makes Tencent Coins more practical in work and learning scenarios.
Developed by TNG Technology Consulting, the DeepSeek R1T Chimera model has officially launched on the OpenRouter platform, providing global developers with efficient and powerful inference capabilities. This new open-source model combines the excellent inference capabilities of DeepSeek R1 with the high performance of V3-0324, marking another significant breakthrough in the balance of performance and efficiency in open-source AI technology. The following is compiled by AIbase.
The rapid advancement of Artificial Intelligence (AI) models has led to concerns about the true performance of these models, despite continuous improvements by developers. To address this, the Vector Institute, founded by Geoffrey Hinton, has released a research study, "Assessing the State of the Art," which provides a comprehensive evaluation of 11 leading open-source and closed-source models through an interactive leaderboard. The evaluation covers mathematics, general knowledge, and coding.
Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest topics in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、Qwen3 is coming soon: Support for Alibaba Cloud's new model has been officially merged into the vLLM code repository. Alibaba Cloud's Qwen3 model is about to be released, marking another significant advancement in its AI endeavors.