Global AI Model Speed Record Broken! Zhipu Releases GLM-5.1 High-Speed Version

AIbase基地

Published inAI News · 3 min read · May 22, 2026

Renowned Chinese artificial intelligence team Zhipu officially announced today the launch of a new GLM-5.1 Highspeed API for selected enterprise customers. This model, codenamed "GLM-5.1-highspeed," has stunned the industry since its release, achieving an impressive output speed of 400 tokens/s.

This figure directly breaks the current global API speed limit set by large model providers, demonstrating strong technical dominance. In the past, the AI industry believed that model speed and size were mutually exclusive, with high speed usually requiring a trade-off in model capabilities.

Breaking Industry Conventions with Flagship Performance

However, the GLM-5.1 Highspeed version completely broke the industry convention that "fast means small." For the first time in domestic large models, this model successfully brought flagship-level technical capabilities and extremely low latency into real production environments.

It is reported that this model was jointly developed by Zhipu's GLM team and the TileRT team. Both teams carried out in-depth and thorough system-level optimizations at three levels: the inference engine, scheduling system, and underlying infrastructure, abandoning traditional dynamic scheduling.

Optimization at Three Levels Ensures Stable Output

In terms of technical details, the development team not only re-wrote the core inference path of the model architecture to improve single-card throughput but also reduced latency in high-concurrency scenarios through techniques like dynamic batching. Meanwhile, collaborative optimization around the infrastructure ensured that 400 TPS became a stable and usable production-level capability.

This high-speed model has a broad range of application prospects, especially suitable for scenarios with strict requirements on response latency. Whether it's AI programming, real-time voice interaction, or frequent business decisions, the model is already available on Zhipu's MaaS platform for selected enterprises.

GLM-5.1-highspeed AINewTerm Wisdom TileRT

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

Welcome to the [AI Daily] segment! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers to help you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance Open-Sources Lance3B: 'One Brain' That Handles Image and Text Understanding and Generation Simultaneously ByteDance has open-sourced its native unified multimodal large model Lance, achieving full functionality with 3B parameters.

May 22, 2026

260

Zhipu Releases GLM-5.1 Speed Version: 400 Tokens/s Sets a New Global API Limit

On May 22, Zhipu's Hong Kong stock surged over 22% intraday, with a market cap exceeding HKD 450 billion. It launched the GLM-5.1 high-speed API, achieving 400 tokens/s output, setting a global record for large model API speed.....

May 22, 2026

310

400 Tokens/s Breaks Global Records! ZhiPu Jointly Launches GLM-5.1 High-Speed Version API with TileRT

Zhipu AI releases GLM-5.1 high-speed API with 400 tokens/s output, setting a global speed record. It breaks the trade-off between performance and latency, achieving flagship capabilities with ultra-low delay in a domestic large model, eliminating the need to compromise between speed and quality.....

May 22, 2026

380

AI Daily: Tencent Launches AI Assistant Marvis; Zhipu AI Releases AutoClaw Mobile App; Photoshop 27.7 Major Update

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Tencent has launched an AI assistant called Marvis at the operating system level, supporting cross-device control and local privacy mode. The Marvis AI assistant developed by Tencent is deeply integrated at the operating system level, providing smart

May 21, 2026

340

Optical Module Becomes the Smart Glasses' 'Invisible Game-Changer'! South Korean Startup LetinAR Secures $18.5 Million in Funding

Imagine speeding at 160 km/h, with navigation arrows appearing on the road ahead, integrated into your helmet visor without needing a phone or looking down. This is not sci-fi but a smart glasses technology launching in Europe this year, just an early glimpse of the evolving wave.....

May 19, 2026

170

Global AI Market: OpenAI and Anthropic Capture 89% of Annual Revenue Share

A survey shows that 34 major AI companies globally generated nearly $80 billion in annual revenue, a 112% increase in six months. OpenAI and Anthropic dominate with about 89% of revenue, highlighting a highly concentrated market.....

May 19, 2026

280

Don't Bet on Language, Bet on Video: Runway's Valuation Surpasses $5.3 Billion and Rivals Google

Runway, an AI video startup founded by art school graduates, has risen on a distinct path with a latest valuation of $5.3 billion. Its Q2 2026 annual recurring revenue (ARR) increased by $40 million, and its Gen-4.5 video generation model solidifies its position in Hollywood film production.....

May 19, 2026

270

OpenAI Temporarily Does Not Publicly Release Voice Cloning Technology, But Secretly Acquires a Company That Develops Voice Cloning

OpenAI developed voice cloning technology two years ago but withheld it due to timing concerns. Recently, they quietly acquired AI model community platform Weights.gg, including its team and all intellectual property, with products like Rep. Details undisclosed, signaling potential acceleration in voice cloning and related fields.....

May 18, 2026

160

Alphabet's Subsidiary Secures $2.1 Billion in Funding: AI-Developed Drugs Enter Accelerated Clinical Phase

Isomorphic Labs, an AI drug discovery company led by Alphabet co-founder Hassabis, raised $2.1B in Series B funding led by Thrive Capital, with Alphabet and sovereign funds participating. The capital will advance AI-discovered drug candidates to clinical trials. Founded in London in 2021, the firm focuses on using AI to revolutionize drug discovery, setting a sector record and highlighting investor confidence in AI-driven pharmaceuticals.....

May 13, 2026

220

Baidu Releases Ernie5.1: Pre-training Cost Drops by 94% and Performance Rises to Top Four in Global Search Rankings

Baidu released its new language model Ernie5.1 on May 11, 2026, based on the pre-trained foundation of Ernie5.0 with 2.4 trillion parameters. Through a 'one-time elastic training framework', it achieves single training optimization for multiple model sizes, with pre-training cost only 6% of similar models. As of May 9, the model ranked fourth globally and first in China on the Arena Search ranking with 1223 points, demonstrating high resource utilization and performance balance.

May 12, 2026

570

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Global AI Model Speed Record Broken! Zhipu Releases GLM-5.1 High-Speed Version

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

Zhipu Releases GLM-5.1 Speed Version: 400 Tokens/s Sets a New Global API Limit

400 Tokens/s Breaks Global Records! ZhiPu Jointly Launches GLM-5.1 High-Speed Version API with TileRT

AI Daily: Tencent Launches AI Assistant Marvis; Zhipu AI Releases AutoClaw Mobile App; Photoshop 27.7 Major Update

Optical Module Becomes the Smart Glasses' 'Invisible Game-Changer'! South Korean Startup LetinAR Secures $18.5 Million in Funding

Global AI Market: OpenAI and Anthropic Capture 89% of Annual Revenue Share

Don't Bet on Language, Bet on Video: Runway's Valuation Surpasses $5.3 Billion and Rivals Google

OpenAI Temporarily Does Not Publicly Release Voice Cloning Technology, But Secretly Acquires a Company That Develops Voice Cloning

Alphabet's Subsidiary Secures $2.1 Billion in Funding: AI-Developed Drugs Enter Accelerated Clinical Phase

Baidu Releases Ernie5.1: Pre-training Cost Drops by 94% and Performance Rises to Top Four in Global Search Rankings

AI News Recommendations

AI Daily: ByteDance Open-Sources Unified Multimodal Large Model Lance 3B; Zhipei Launches GLM-5.1 High-Speed Version; CapCut Collaborates with Gemini for Deep Integration

Zhipu Releases GLM-5.1 Speed Version: 400 Tokens/s Sets a New Global API Limit

400 Tokens/s Breaks Global Records! ZhiPu Jointly Launches GLM-5.1 High-Speed Version API with TileRT

AI Daily: Tencent Launches AI Assistant Marvis; Zhipu AI Releases AutoClaw Mobile App; Photoshop 27.7 Major Update

Optical Module Becomes the Smart Glasses' 'Invisible Game-Changer'! South Korean Startup LetinAR Secures $18.5 Million in Funding

Global AI Market: OpenAI and Anthropic Capture 89% of Annual Revenue Share

Don't Bet on Language, Bet on Video: Runway's Valuation Surpasses $5.3 Billion and Rivals Google

OpenAI Temporarily Does Not Publicly Release Voice Cloning Technology, But Secretly Acquires a Company That Develops Voice Cloning

Alphabet's Subsidiary Secures $2.1 Billion in Funding: AI-Developed Drugs Enter Accelerated Clinical Phase

Baidu Releases Ernie5.1: Pre-training Cost Drops by 94% and Performance Rises to Top Four in Global Search Rankings