MegaT model launched on PyPI Achieving 400 tokens/second ultra-fast response

AIbase基地

Published inAI News · 4 min read · May 27, 2025

Miota AI Search has launched a brand-new "Speed" model, marking a significant breakthrough in its artificial intelligence search technology. Through innovative technical means, the response speed of Miota AI Search has reached an astonishing 400 tokens/second, ensuring that most questions can be answered within 2 seconds. This advancement not only enhances user experience but also significantly improves the efficiency of information retrieval.

The realization of this "Speed" model is due to the application of multiple advanced technologies. The Miota AI team optimized kernel fusion on GPUs and implemented dynamic compilation optimization on CPUs. These combined technologies maximize the performance of a single H800 GPU. Users can clearly feel that the model not only responds faster but also demonstrates a significant improvement in answer accuracy and clearer logical structure.

To allow users to intuitively experience this technological innovation, Miota AI Search also provides a speed testing site where users can freely input questions and experience the charm of rapid responses. This speed testing site is open for only one week, attracting numerous users to try it out. On this platform, users can see the real-time response process and experience the convenience brought by AI search.

In the tests, Miota AI Search randomly selected two questions for answers. The first question was about why "tear-off sheets" suddenly became popular, and the "Speed" mode quickly provided an answer, showcasing the model's fast response capability. The second question focused on the research progress of CRISPR-Cas9 in treating genetic diseases, using the "Speed-Thinking" mode for detailed answers, demonstrating the model's clarity in handling complex issues.

The Miota AI Search team stated that they will continue to focus on technological innovation and further enhance the intelligence level and user experience of AI. Users can look forward to more features being released and more efficient search experiences in the future.

Key Points:

🌟 The newly launched "Speed" model responds at a rate of up to 400 tokens/second, ensuring that most questions are answered within 2 seconds.

⚙️ Through GPU kernel fusion and CPU dynamic compilation optimization, the model's accuracy and logical clarity have been improved.

🚀 Users can experience the quick response of AI search through the speed testing site, with random test questions showcasing the technical advantages.

MegaT AI new word Ultra-fast model GPU optimization

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

xAI Launches New CLI Tool Grok Build to Help Developers Code More Efficiently!

xAI recently released the early version of the Agentic command line tool "Grok Build", designed specifically for developers to simplify coding, building applications, and automating workflows. It is currently available only to SuperGrok Heavy subscribers and can be accessed via x.ai/cli. The tool is positioned as a smart development assistant, offering more advanced features than traditional command lines.

May 15, 2026

190

Qwen APP Deeply Integrates with the National Medical Products Administration Data, Launching Millions of Authoritative Information on Medicines and Medical Devices

Qwen APP has reached a strategic cooperation with the National Medical Products Administration Information Center, fully integrating millions of national-level authoritative data on medicines, cosmetics, and medical devices. This move aims to address the 'hallucination' issue in AI health consultations by real-time verification against authoritative databases, enhancing the accuracy of information and providing precise medication guidance and ingredient analysis for tens of millions of users, marking a crucial step in the compliance and professional development of domestic large models in vertical fields.

May 15, 2026

150

WeChat Releases Youth AI Insight Report: Token Consumption Exceeds 50 Billion, Generative AI Becomes Standard in Teaching

According to the official report "Global Youth AI+Mini Program Insight Report" released by WeChat, the results of the educational platform's open four years are remarkable: the annual token consumption in AI creation by teachers and students has exceeded 50 billion, equivalent to 3.75 million deep conversations. The platform has attracted nearly 80,000 students and 17,000 teachers globally, with more than 280,000 mini program projects created in total, marking that generative AI has been deeply integrated into youth programming education.

May 15, 2026

170

Dongguan Announces: One Out of Every Two AI Glasses in the World is Made in Dongguan!

Dongguan released the "Global Smart Manufacturing Center Construction Plan," stating that one out of every two AI glasses in the world is produced in Dongguan, highlighting its manufacturing advantages in the wearable devices sector. The AI glasses industry has become a landmark field for cultivating new productive forces in Dongguan. Local company Qianhuanshiheng has spent a decade developing smart glasses from concept to mass consumer adoption, reinforcing Dongguan's position as a core hub in the global supply chain.

May 15, 2026

170

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

After the merger of SpaceX and xAI into SpaceXAI, over 50 top researchers and engineers have left since February this year, affecting core teams such as programming assistants, world models, and Grok voice interaction. The pre-training team has been severely poached by competitors, raising concerns about the sustainability of the technology.

May 15, 2026

110

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present the latest content in the AI field, focusing on developers to help you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. WeChat announced that the Mini Program Growth Plan has officially integrated the Hy3preview model, enhancing the AI capabilities to improve development.

May 15, 2026

180

Lucyd Launches AI Real-Time Translation Calls: Native Language Communication Like a Walkie-Talkie, Smart Glasses Manufacturers Compete for New Opportunities in the Wearable AI Platform Market

Innovative Eyewear released a major update to its Lucyd app, adding AI real-time call translation. Users with compatible smart glasses can converse in their native language globally, receiving natural-sounding, consistent-voice translations. This shift marks smart glasses' evolution from hardware competition to voice AI platform ecosystems, breaking language barriers and enabling cross-border collaboration.....

May 15, 2026

160

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

AI coding startup Cursor launches global expansion, plans to hire 200 employees in Asia-Pacific within six months, including marketing, on-site, and AI deployment engineers. The company has established an office in Singapore, led by Simon Green, with subsequent hiring covering Japan, Sydney, Melbourne, and India, accelerating technological globalization.

May 15, 2026

150

Alibaba Cloud AI Webtoon Solution: The Short Drama Industry Enters a New Intelligent Era!

On May 14, Alibaba Cloud held an 'AI Innovation Day' event in Zhengzhou High-tech Zone, unveiling an intelligent solution for short comic and drama creation. Centered on 'model + platform + tools + ecosystem,' it aims to advance AI-driven short comics from generation to large-scale production. According to Alibaba Cloud Chief Architect Li Jin, the short drama industry is rapidly growing, with China's animation market expected to see significant d....

May 15, 2026

160

Alibaba Cloud Launches Qoder 1.0: Evolving from AI IDE to an Autonomous Agent Development Workspace

Alibaba Cloud launches Qoder 1.0, achieving a strategic upgrade from AI IDE to an "Autonomous Agent Development Workspace." Its core is the Agent-first work paradigm, allowing users to simply define requirements, after which the Agent team can autonomously complete the entire process from execution, validation to delivery. A highlight is the Quest independent window, integrating task management, status tracking, and artifact review functions, freeing developers from engineering details.

May 15, 2026

170

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

MegaT model launched on PyPI Achieving 400 tokens/second ultra-fast response

AIbase基地

This article is from AIbase Daily

AI News Recommendations

xAI Launches New CLI Tool Grok Build to Help Developers Code More Efficiently!

Qwen APP Deeply Integrates with the National Medical Products Administration Data, Launching Millions of Authoritative Information on Medicines and Medical Devices

WeChat Releases Youth AI Insight Report: Token Consumption Exceeds 50 Billion, Generative AI Becomes Standard in Teaching

Dongguan Announces: One Out of Every Two AI Glasses in the World is Made in Dongguan!

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

Lucyd Launches AI Real-Time Translation Calls: Native Language Communication Like a Walkie-Talkie, Smart Glasses Manufacturers Compete for New Opportunities in the Wearable AI Platform Market

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

Alibaba Cloud AI Webtoon Solution: The Short Drama Industry Enters a New Intelligent Era!

Alibaba Cloud Launches Qoder 1.0: Evolving from AI IDE to an Autonomous Agent Development Workspace

AI News Recommendations

xAI Launches New CLI Tool Grok Build to Help Developers Code More Efficiently!

Qwen APP Deeply Integrates with the National Medical Products Administration Data, Launching Millions of Authoritative Information on Medicines and Medical Devices

WeChat Releases Youth AI Insight Report: Token Consumption Exceeds 50 Billion, Generative AI Becomes Standard in Teaching

Dongguan Announces: One Out of Every Two AI Glasses in the World is Made in Dongguan!

Core Talent is Accelerating Its Loss, SpaceXAI Faces R&D Challenges under Elon Musk's New Team

AI Daily: WeChat Mini Program Officially Integrates Hy3 Preview; QQ Browser Launches Gaokao AI Skill; Moonlight Releases Kimi WebBridge

Lucyd Launches AI Real-Time Translation Calls: Native Language Communication Like a Walkie-Talkie, Smart Glasses Manufacturers Compete for New Opportunities in the Wearable AI Platform Market

AI Coding Startup Cursor Plans to Hire 200 People in Asia-Pacific, Previously Received Significant Investment from SpaceX

Alibaba Cloud AI Webtoon Solution: The Short Drama Industry Enters a New Intelligent Era!

Alibaba Cloud Launches Qoder 1.0: Evolving from AI IDE to an Autonomous Agent Development Workspace