MiTA AI Search Launches New Ultra-Fast Model: Up to 400 tokens/second response speed

AIbase基地

Published inAI News · 2 min read · May 27, 2025

Recently, the "Ultra-fast" model of Metasumo AI Search has been officially launched, providing users with a more efficient and precise search experience.

The Metasumo AI Search team successfully achieved a response speed of up to 400 tokens/second on a single H800 GPU by implementing kernel fusion technology on GPUs and dynamic compilation optimization strategies on CPUs. Most questions can be answered within 2 seconds.

To allow users to truly feel the speed of the new model, Metasumo AI Search specially set up a speed testing site (kuai.metaso.cn), where users can input questions at any time to personally experience the rapid response brought by the new model.

The "Ultra-fast" model launched this time by Metasumo AI Search is expected to trigger a new round of technological innovation in the AI search field due to its excellent speed, accuracy, and logic, providing users with higher-quality search services.

MiTA AI Search Ultra-Fast Model GPU AI Search

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Buyer Uses AI to Alter Image for Malicious Refund, Platform Instantly Approves Arbitration, Forcing New Store Owner to Lose Heart and Exit the Platform

Mr. Zhang, a fruit farmer from Xinji City, Hebei, sold only 12 orders of fig saplings on a live-streaming e-commerce platform, but faced a 'refund only' request. The buyer attached a photo of withered leaves claiming the sapling died, but Zhang, using his experience, identified the photo as AI-generated, questioning its authenticity.....

May 18, 2026

250

Baidu's AI Cloud Revenue Reached 8.8 Billion in the First Quarter, Surging 79% - Kunlun Xiang P800 Delivers a Cluster of 10,000 Cards

Baidu reported Q1 2026 revenue of 32.1 billion yuan, with core business revenue of 26 billion yuan, up 2% year-on-year, exceeding expectations. AI business surged, with AI cloud revenue reaching 8.8 billion yuan, up 79%, and GPU cloud revenue skyrocketing 184%. Baidu AI Cloud has been upgraded to a full-stack AI cloud for large-scale agent applications, enhancing capabilities from underlying computing power to agent applications.....

May 18, 2026

220

Zhang Chaoyang Talks About Sohu's AI Strategy: Not Engaging in the Big Model Arms Race, Focusing on Application Implementation and Content Neutrality

Zhang Chaoyang, CEO of Sohu, stated at the 2026 Sohu Tech Forum that the company is not involved in developing large models with hundreds of billions of parameters in the 'first tier,' but focuses on rational applications based on its own business. This strategy reflects a pragmatic choice for mid-sized tech companies amid the AI boom, avoiding competition with giants by concentrating resources and making strategic trade-offs to prioritize real-w....

May 18, 2026

220

AI That Can Disguise Itself: New Research Shows That People Generally Believe Artificial Intelligence Is More Confident Than Humans

New research reveals a 'confidence illusion' where people perceive AI as more confident than humans, even when responses are identical. Published in Communications Psychology by University of Waterloo and UCL, the study shows this bias may influence decision-making when AI advice is involved.....

May 18, 2026

210

Pizza Hut's AI Delivery Promotion Leads to Sharp Drop in Performance, Franchisees Claim Over $1 Billion in Compensation

Pizza Hut US franchisee Chuck Northeast Pizza Company sues the brand, claiming that the forced introduction of AI delivery system 'Dragon Tail' caused operational chaos in about 111 stores, significantly declining performance, and resulting in business losses and asset depreciation exceeding expectations.....

May 18, 2026

170

AI Daily: Tencent Launches Design Agent Ardot; Qwen Will Release a Major Model on May 20th; OpenAI Launches ChatGPT Personal Finance Tool

Welcome to the [AI Daily] section! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you gain insights into technological trends and understand innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Tencent launches design agent Ardot, generating design drafts with one sentence and converting them to code with one click. The AI design agent platform Ardot launched by Tencent Cloud can generate editable design drafts with one sentence and

May 18, 2026

170

Sony Clarifies Xperia 1 III AI Camera Assistant Function: Provides Shooting Suggestions, Not Direct Editing

Sony clarified that the 'AI Camera Assistant' on Xperia 1XIII is not an image editing or generation tool. Instead, it uses on-device visual perception to offer real-time optimization suggestions based on light, depth, and subject features, aiming to enhance photography without altering images.....

May 18, 2026

160

AI Junk Reports Overwhelm Security Channels, Linux Creator Condemns Misuse of Technology

Linus Torvalds, upon releasing a new Linux kernel version, sternly warned about AI tool misuse. A flood of AI-generated duplicate error reports clogged kernel security channels, burdening maintainers with futile work. He didn't ban AI outright but criticized low-barrier submissions, where multiple users produced identical bugs using the same tool.....

May 18, 2026

190

AI Large Models Implemented in Grassroots Law Enforcement: Doubao Helps Police Solve Cases at an Accelerated Pace

Hubei Qianjiang police, faced with a 'zero clue' scene involving no surveillance or witnesses while investigating a construction site diesel theft case, used ByteDance's AI model 'Doubao' to input a key 1440mm wheelbase measurement from the scene, achieving a breakthrough. This demonstrates the practical value of large language models penetrating grassroots social governance and smart policing.....

May 18, 2026

150

5 Days to Crack Apple M5's Strongest Memory Defense! AI-Assisted Pure Data Privilege Escalation Causes a Major Shift in Mac Security

On May 14, 2026, Palo Alto security research firm Calif released a 55-page technical report, announcing a privilege escalation from normal user to root shell on macOS devices with M5 chips in just 5 days, using a pure 'data-only' attack without code injection, sparking high security concern.....

May 18, 2026

170

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

MiTA AI Search Launches New Ultra-Fast Model: Up to 400 tokens/second response speed

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Buyer Uses AI to Alter Image for Malicious Refund, Platform Instantly Approves Arbitration, Forcing New Store Owner to Lose Heart and Exit the Platform

Baidu's AI Cloud Revenue Reached 8.8 Billion in the First Quarter, Surging 79% - Kunlun Xiang P800 Delivers a Cluster of 10,000 Cards

Zhang Chaoyang Talks About Sohu's AI Strategy: Not Engaging in the Big Model Arms Race, Focusing on Application Implementation and Content Neutrality

AI That Can Disguise Itself: New Research Shows That People Generally Believe Artificial Intelligence Is More Confident Than Humans

Pizza Hut's AI Delivery Promotion Leads to Sharp Drop in Performance, Franchisees Claim Over $1 Billion in Compensation

AI Daily: Tencent Launches Design Agent Ardot; Qwen Will Release a Major Model on May 20th; OpenAI Launches ChatGPT Personal Finance Tool

Sony Clarifies Xperia 1 III AI Camera Assistant Function: Provides Shooting Suggestions, Not Direct Editing

AI Junk Reports Overwhelm Security Channels, Linux Creator Condemns Misuse of Technology

AI Large Models Implemented in Grassroots Law Enforcement: Doubao Helps Police Solve Cases at an Accelerated Pace

5 Days to Crack Apple M5's Strongest Memory Defense! AI-Assisted Pure Data Privilege Escalation Causes a Major Shift in Mac Security

AI News Recommendations

Buyer Uses AI to Alter Image for Malicious Refund, Platform Instantly Approves Arbitration, Forcing New Store Owner to Lose Heart and Exit the Platform

Baidu's AI Cloud Revenue Reached 8.8 Billion in the First Quarter, Surging 79% - Kunlun Xiang P800 Delivers a Cluster of 10,000 Cards

Zhang Chaoyang Talks About Sohu's AI Strategy: Not Engaging in the Big Model Arms Race, Focusing on Application Implementation and Content Neutrality

AI That Can Disguise Itself: New Research Shows That People Generally Believe Artificial Intelligence Is More Confident Than Humans

Pizza Hut's AI Delivery Promotion Leads to Sharp Drop in Performance, Franchisees Claim Over $1 Billion in Compensation

AI Daily: Tencent Launches Design Agent Ardot; Qwen Will Release a Major Model on May 20th; OpenAI Launches ChatGPT Personal Finance Tool

Sony Clarifies Xperia 1 III AI Camera Assistant Function: Provides Shooting Suggestions, Not Direct Editing

AI Junk Reports Overwhelm Security Channels, Linux Creator Condemns Misuse of Technology

AI Large Models Implemented in Grassroots Law Enforcement: Doubao Helps Police Solve Cases at an Accelerated Pace

5 Days to Crack Apple M5's Strongest Memory Defense! AI-Assisted Pure Data Privilege Escalation Causes a Major Shift in Mac Security