Following the DeepSeek R1, the Alibaba Cloud Tongyi Qianwen team has just announced the launch of its latest open-source model
The newly released Qwen2.5-1M series includes two open-source models:
Following the DeepSeek R1, the Alibaba Cloud Tongyi Qianwen team has just announced the launch of its latest open-source model
The newly released Qwen2.5-1M series includes two open-source models:
Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.
Developed by TNG Technology Consulting, the DeepSeek R1T Chimera model has officially launched on the OpenRouter platform, providing global developers with efficient and powerful inference capabilities. This new open-source model combines the excellent inference capabilities of DeepSeek R1 with the high performance of V3-0324, marking another significant breakthrough in the balance of performance and efficiency in open-source AI technology. The following is compiled by AIbase.
The rapid advancement of Artificial Intelligence (AI) models has led to concerns about the true performance of these models, despite continuous improvements by developers. To address this, the Vector Institute, founded by Geoffrey Hinton, has released a research study, "Assessing the State of the Art," which provides a comprehensive evaluation of 11 leading open-source and closed-source models through an interactive leaderboard. The evaluation covers mathematics, general knowledge, and coding.
Welcome to the AI Daily column! Your daily guide to exploring the world of artificial intelligence. We present you with the hottest topics in the AI field, focusing on developers and helping you understand technology trends and innovative AI product applications. Discover new AI products here: https://top.aibase.com/1、Qwen3 is coming soon: Support for Alibaba Cloud's new model has been officially merged into the vLLM code repository. Alibaba Cloud's Qwen3 model is about to be released, marking another significant advancement in its AI endeavors.
Kuaishou announced that its search function has fully integrated the DeepSeek R1 large model, aiming to significantly improve search results and user experience, and further drive user activity growth. Simultaneously, Kuaishou is also looking to the future, actively exploring the commercialization potential of intelligent search scenarios. Previously, Kuaishou's AI content creation platform, "Keling AI", has already integrated DeepSeek R1. In the video and image generation fields, users can now leverage DeepSeek's powerful capabilities to generate or optimize prompts more efficiently.
Reka AI, founded by a dozen former Google DeepMind scientists, has unveiled its first open-source model: Reka Flash 3. This 21-billion parameter inference model has garnered significant attention. Despite its relatively smaller parameter count, Reka Flash 3 is a general-purpose reasoning model trained from scratch. It underwent supervised fine-tuning on synthetic and public datasets and further refinement through model-based techniques.
Nvidia has introduced Dynamo, a new software designed to significantly accelerate the performance of its DeepSeek AI technology, promising a 30-fold speed increase.