Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

腾讯混元 TurboS 技術レポートで560BパラメーターのハイブリッドMambaアーキテクチャが完全に解説される

AIbase基地

Publicado elNoticias de IA · 1 minutos de lectura · May 22, 2025

腾讯は、そのフラッグシップ大言語モデルであるTurboSの核心的な革新と強力な能力を明らかにするため、Hybrid Transformer-Mamba技術レポートを公開しました。

世界中の権威ある大規模モデル評価プラットフォームであるChatbot Arenaの最新ランキングによると、混元TurboSは239の参加モデルの中で第7位にランクされ、Deepseekに次ぐ国内トップモデルとなり、国際的にはGoogle、OpenAI、xAIなどの数社に次ぐ位置にあります。

混元TurboSモデルのアーキテクチャには、革新的なHybrid Transformer-Mamba構造が採用されており、この新しい設計は、Mambaアーキテクチャの長さの長い系列処理の効率性とTransformerアーキテクチャの文脈理解の利点を組み合わせることで、性能と効率のバランスを実現しています。このモデルは合計128層を持ち、アクティベーションパラメータ量は560億を超え、業界初の大規模展開となるTransformer-Mambaエキスパート混合モデル(MoE)です。このようなアーキテクチャの革新により、TurboSは国際的な権威ある評価で全体的なスコア1356を達成しました。

さらに、モデルの能力を向上させるために、混元TurboSは自己適応の長短思考連鎖メカニズムを導入し、問題の複雑さに基づいて応答モードを自動的に切り替えることが可能です。このメカニズムにより、モデルは簡単な問題に対して迅速に応答し、複雑な問題に対しては深く分析して高精度な答えを提供できます。また、チームは、監督微調整、自己適応の長短CoT融合などを含む4つの重要なモジュールで構成される後訓練プロセスを設計し、モデルのパフォーマンスをさらに強化しました。

事前学習段階では、混元TurboSは16兆トークンのコーパスで学習を行い、モデルのデータ品質と多様性を確保しました。その核心アーキテクチャには、Transformer、Mamba2、および前方ニューラルネットワーク(FFN)コンポーネントが含まれており、層の構成が合理的であり、トレーニングと推論の効率を最大限に高めています。

今回の技術レポートの発表は、 Tencent が大言語モデル分野での技術力だけでなく、今後の大型モデルの発展に向けた新たなアイデアや方向性を示すものでもあります。

論文リンク: https://arxiv.org/abs/2505.15431

要点整理:

🌟 TurboS モデルは Chatbot Arena で第7位にランクされ、非常に強い競争力を示しました。

💡 革新的な Hybrid Transformer-Mamba アーキテクチャは、性能と効率の最適なバランスを実現しました。

🔍 自己適応の長短思考連鎖メカニズムは、モデルが異なる複雑さの問題に対する応答能力を向上させました。

Este artículo proviene de AIbase Daily

¡Bienvenido a la columna [AI Diario]! Aquí está tu guía diaria para explorar el mundo de la inteligencia artificial. Todos los días te presentamos el contenido más destacado en el campo de la IA, centrándonos en los desarrolladores para ayudarte a comprender las tendencias tecnológicas y conocer las aplicaciones innovadoras de productos de IA.

—— Creado por el grupo AIbase Daily

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

腾讯混元 TurboS 技術レポートで560BパラメーターのハイブリッドMambaアーキテクチャが完全に解説される

AIbase基地

Este artículo proviene de AIbase Daily

GEO Services