Gemini 3 Flash Launches: Free, Fast, and Intelligence Surpasses Pro - Google AI Fully Enters the Zero-Latency Era

AIbase基地

Published inAI News · 5 min read · Dec 18, 2025

118

Google once again redefines the boundaries of performance and cost for large models. Today, the company officially released its new lightweight model, Gemini 3 Flash — not only does it respond three times faster than its predecessor, achieving near "zero latency," but it also surpasses the current flagship model Gemini 3 Pro in multiple high-difficulty benchmark tests, becoming the first "Flash" model in history to "defeat the elder brother" in a simultaneous comparison. More surprisingly, this top-tier performance version is globally free and defaults to being integrated into the Gemini App, AI Studio, Google Antigravity, and CLI tools.

The groundbreaking performance of Gemini 3 Flash can be described as a "downgrade attack":

- On the authoritative code repair ranking SWE-bench, it scores 78%, slightly ahead of Gemini 3 Pro (76.2%);

- It achieved a high score of 90.4% in the doctoral-level reasoning test GPQA Diamond;

- In the extremely difficult comprehensive assessment Humanity’s Last Exam (no tool mode), it scored 33.7%, significantly better than the previous flagship Gemini 2.5 Pro;

- It ranked third globally in the LMArena text capability ranking.

This performance miracle stems from Google's deep optimization of the model architecture: while maintaining extremely low inference costs, it uses techniques such as knowledge distillation, inference path compression, and multimodal alignment, enabling the small model to have logic depth close to that of large models. When users upload an image or video, Flash can analyze the content and generate an executable plan within seconds — from identifying circuit faults to planning travel routes, responding like lightning.

To adapt to different scenarios, the new Gemini App introduces three interaction modes:

- Speed Mode: Default enables Gemini 3 Flash, suitable for daily Q&A;

- Think Mode: Activates Flash's deep reasoning chain, handling complex logical problems;

- Professional Mode: Retains Gemini 3 Pro, focusing on high-difficulty math and programming tasks.

This means that ordinary users can enjoy intelligent experiences previously limited to premium subscriptions without paying. The complex questions you ask in Google search are now powered by an AI engine with top-tier reasoning capabilities behind them.

Market data confirms the success of this strategy: the monthly active users of Gemini App soared from 450 million to 650 million in just one quarter, with over 13 million developers and API call volume increasing threefold year-over-year. With the addition of Flash, the Gemini 3 product line has formed a clear hierarchy — Deep Think (deep reasoning), Pro (professional breakthrough), Flash (inclusive speed) — fully covering the entire spectrum of needs from general users to research developers.

Gemini3Flash AI New Terms Google Brand Product Terms

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

5 Billion Qwen Helped Me! Alibaba Qwen Spring Festival Event Over 130 Million People Participated in AI Life Services

During Alibaba's Qwen App Spring Festival event, over 130 million users utilized AI assistants for services like ordering milk tea and stocking up on New Year goods, with 'Qwen Help Me' used 5 billion times, integrating AI deeply into holiday consumption. Post-launch, AI-driven movie ticket purchases saw significant growth.....

Feb 17, 2026

140

Yushu Robotics Demonstrates Global First Stunt at the Spring Festival Gala, Vault Height Exceeds 3 Meters

At the 2026 Spring Festival Gala, Yushu Humanoid Robots performed the martial arts routine "Dance BOT" with children, breaking multiple motion limits: vault height exceeds 3 meters, performing continuous one-foot flips, reaching a maximum running speed of 4m/s, and completing high-difficulty actions such as somersaults and wielding sticks and swords, demonstrating excellent stability and flexibility.

Feb 17, 2026

Qwen3.5-Plus Open-Sourced on the Eve of Chinese New Year, Ranking as the World's Strongest Open-Source Large Model

On the eve of Chinese New Year in 2026, Alibaba opened-source the new generation large model Qwen3.5-Plus, whose performance rivals that of Gemini3Pro, becoming the world's strongest open-source large model. The model adopts a revolutionary underlying architecture, with 397 billion parameters but only 17 billion activated, surpassing the Qwen3-Max with trillions of parameters at a smaller scale. The deployment memory usage is reduced by 60%, and the long context reasoning throughput is increased by 19 times. The API cost is as low as 0.8 yuan per million Tokens, just 1/18th of Gemini3Pro.

Feb 17, 2026

140

1.9 Billion Interactions on Chinese New Year's Eve! Doubao AI Deeply Empowers the 2026 Spring Festival Gala to Set a Record for Real-Time Creation

On the eve of the 2026 Chinese New Year, the CCTV Spring Festival Gala introduced AI-based real-time interactive creation for the first time. Doubao, a platform under ByteDance, served as the core platform, with a total of 1.9 billion AI interactions. Among them, the "Doubao New Year" activity generated over 50 million New Year headshots and 100 million greetings. AI-generated images and New Year greeting messages became a new trend for the Spring Festival. The interaction enthusiasm reached its peak at 9:46 PM on the eve of Chinese New Year when the host of the Spring Festival Gala announced it.

Feb 17, 2026

100

JD.com Launches JoyAI-LLM-Flash Large Model to Promote AI Technological Innovation

JD.com open-sourced the large model JoyAI-LLM-Flash, with 4.8 billion parameters and 3 billion activated parameters, pre-trained on 20 trillion text, featuring advanced knowledge comprehension, reasoning, and programming capabilities. It adopts the FiberPO optimization framework, combining fiber bundle theory and reinforcement learning, using the Muon optimizer and dense multi-Token prediction technology to solve the instability problem in model scale expansion.

Feb 16, 2026

210

Musk Predicts the Death of Programming: AI Writes Binary Code Directly, Intermediate-Level Development May Becomes History

Musk predicts that by 2026, AI will generate efficient binary code directly from requirements, bypassing traditional programming languages and source code, potentially making programming jobs obsolete.....

Feb 16, 2026

200

MiniMax M2.5-HighSpeed: 3 Times Faster Inference Speed, Empowering AI Applications

After the release of the MiniMax M2.5 model, it was quickly integrated into over 50 platforms, and the M2.5-highspeed model was launched, with an inference speed of 100 TPS, three times that of similar products. At the same time, three types of Coding Plan packages were released, and users can enjoy a 90% discount by inviting friends, continuously improving AI service efficiency.

Feb 16, 2026

160

Voice AI? NPR Senior Host Sues Google, Accuses NotebookLM of Voice Infringement

American host David Green sued Google, accusing its AI tool NotebookLM of generating a podcast male voice that illegally imitates his voice, saying 'My voice is my soul.' Green stated that after the feature was launched, friends and colleagues asked whether it was recorded by him. After comparison, he confirmed the infringement.

Feb 16, 2026

150

WeChat Launches New AI Feature for Golden Moments: Liking Posts Brings Red Packet Surprises!

Tencent WeChat has introduced a new feature for the Spring Festival. Users can create a "New Year Greeting Moments" by using the Yuanbao App, which triggers a golden moments effect. Liking someone else's golden moments may bring a red packet. At the same time, WeChat has added an AI New Year Greeting Song creation function, allowing users to compose personalized songs in "Listen and Watch." After the update, users can also add decorations to enhance the festive atmosphere.

Feb 16, 2026

170

Qwen3.5 Makes Its Debut on New Year's Eve, Alibaba Fully Innovates Its Artificial Intelligence Architecture

Alibaba to release Qwen3.5, a new AI model with architectural innovations for improved performance and flexibility, enhancing its influence in smart tech.....

Feb 16, 2026

150

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Gemini 3 Flash Launches: Free, Fast, and Intelligence Surpasses Pro - Google AI Fully Enters the Zero-Latency Era

AIbase基地

This article is from AIbase Daily

AI News Recommendations

5 Billion Qwen Helped Me! Alibaba Qwen Spring Festival Event Over 130 Million People Participated in AI Life Services

Yushu Robotics Demonstrates Global First Stunt at the Spring Festival Gala, Vault Height Exceeds 3 Meters

Qwen3.5-Plus Open-Sourced on the Eve of Chinese New Year, Ranking as the World's Strongest Open-Source Large Model

1.9 Billion Interactions on Chinese New Year's Eve! Doubao AI Deeply Empowers the 2026 Spring Festival Gala to Set a Record for Real-Time Creation

JD.com Launches JoyAI-LLM-Flash Large Model to Promote AI Technological Innovation

Musk Predicts the Death of Programming: AI Writes Binary Code Directly, Intermediate-Level Development May Becomes History

MiniMax M2.5-HighSpeed: 3 Times Faster Inference Speed, Empowering AI Applications

Voice AI? NPR Senior Host Sues Google, Accuses NotebookLM of Voice Infringement

WeChat Launches New AI Feature for Golden Moments: Liking Posts Brings Red Packet Surprises!

Qwen3.5 Makes Its Debut on New Year's Eve, Alibaba Fully Innovates Its Artificial Intelligence Architecture

AI News Recommendations

5 Billion Qwen Helped Me! Alibaba Qwen Spring Festival Event Over 130 Million People Participated in AI Life Services

Yushu Robotics Demonstrates Global First Stunt at the Spring Festival Gala, Vault Height Exceeds 3 Meters

Qwen3.5-Plus Open-Sourced on the Eve of Chinese New Year, Ranking as the World's Strongest Open-Source Large Model

1.9 Billion Interactions on Chinese New Year's Eve! Doubao AI Deeply Empowers the 2026 Spring Festival Gala to Set a Record for Real-Time Creation

JD.com Launches JoyAI-LLM-Flash Large Model to Promote AI Technological Innovation

Musk Predicts the Death of Programming: AI Writes Binary Code Directly, Intermediate-Level Development May Becomes History

MiniMax M2.5-HighSpeed: 3 Times Faster Inference Speed, Empowering AI Applications

Voice AI? NPR Senior Host Sues Google, Accuses NotebookLM of Voice Infringement

WeChat Launches New AI Feature for Golden Moments: Liking Posts Brings Red Packet Surprises!

Qwen3.5 Makes Its Debut on New Year's Eve, Alibaba Fully Innovates Its Artificial Intelligence Architecture

GEO Services