400 Tokens/s Breaks Global Records! ZhiPu Jointly Launches GLM-5.1 High-Speed Version API with TileRT
Zhipu AI releases GLM-5.1 high-speed API with 400 tokens/s output, setting a global speed record. It breaks the trade-off between performance and latency, achieving flagship capabilities with ultra-low delay in a domestic large model, eliminating the need to compromise between speed and quality.....