Google Expands Free Quotas for Gemini API: Some Models' Throughput Rises to Hundreds of Thousands per Minute

AIbase基地

Published inAI News · 4 min read · Jul 2, 2026

In the fierce competition of generative AI, computing power and call costs have always been the "lifeblood" that developers care about most. Recently, Google has released a major benefit in the developer ecosystem: the free quota for some accounts has been significantly increased for the Gemini API, and the single-minute Token processing limit (TPM) for some models has officially reached 1 million.

According to test feedback, this adjustment mainly covers the Gemini 2.5 series models. Among them, the lightweight models Gemini 2.5 Flash and Flash-Lite have already achieved an ultra-high throughput of 1 million Tokens per minute in some accounts. More attractively, this free tier still maintains an extremely low threshold with "no need to bind a card and no limit on total volume," providing personal developers and startup teams with a highly competitive low-cost trial space.

However, Google's recent strategic expansion shows clear "differentiation." Not all users can enjoy this top-level quota, and performance restrictions between different models still exist. Currently, although the Token processing limit has been significantly relaxed, the request frequency limit (RPM) for each model is still controlled between 15 to 30 requests per minute, and the daily request total (RPD) is locked at 1,500 requests. Additionally, as the high-end option in this series, the Pro version model is not yet included in the free access list.

For developers concerned about privacy, it is worth noting that Google explicitly states in the service terms that it has the right to use prompts (Prompts) and feedback content under the free tier for model training. To address this potential data compliance issue, developers can check their account's specific quota details through the official query page and assess whether to upgrade to a paid version based on the sensitivity of their business.

Industry professionals believe that Google's move is not only to attract developers to migrate to its API ecosystem through high-spec free quotas but also to maintain its leading position in the inference service market by offering extreme cost-effectiveness amid the impact of open-source models. As this free strategy continues to expand, the barriers for individual developers to build complex AI applications are expected to be further reduced.

Generative AI Gemini API Computing Power Token

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Moving Away from One-Size-Fits-All: Cloudflare Introduces Fine-Grained AI Traffic Management to Build a Defensive Barrier for Website Monetization

Website owners struggle with AI crawlers needing search traffic but fearing content theft. Cloudflare's new tool (July 1) enables granular control by crawler type, moving beyond blanket blocking. It maintains search visibility while preventing data abuse.....

Jul 2, 2026

100

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

xAI launches the Voice Agent Builder beta version, allowing enterprises to build high-level speech AI agents in just two minutes through a no-code platform and its self-developed Grok Voice model. Its core is a highly integrated end-to-end architecture that addresses the pain points of traditional solutions with fragmented processes such as speech-to-text conversion, significantly lowering development and operational barriers.

Jul 2, 2026

Rejecting the Cutthroat Collaboration: Kuaishou Tian Gong 3.2 Launches Skywork Tags to Make AI a Reliable Colleague in Work Groups

Kuaishou released version 3.2 of Tian Gong, introducing the Skywork Tags feature, targeting the efficiency loss caused by frequently switching windows to move data. The core logic is not to change the existing collaboration process of teams, but to directly integrate AI agents into existing office groups such as Slack, Feishu, DingTalk, Discord, and Telegram, allowing the intelligent agents to seamlessly fit into the work environment.

Jul 2, 2026

Alipay Abao Public Testing Launches: Say Farewell to Menu Navigation, Enter a New Era of Conversational Services

The AI assistant 'Ant Abao' under Alipay has launched public testing. Users can search for 'Abao' or swipe right to enter the conversation interface to experience it. As the core of Alipay's transition from traditional display-based interaction to conversational services, Abao provides intuitive and efficient intelligent services through a minimalist chat window.

Jul 2, 2026

100

Alipay AI Life Assistant Aobao Officially Launches Public Testing, Entirely Removes Invitation Code Restrictions

The Alipay AI Life Assistant Aobao officially opened public testing on July 2nd, allowing iOS and Android users to experience it directly without an invitation code. After a month of internal testing and iteration, the app has officially entered large-scale market verification. The public testing version highlights its service capabilities in life scenarios.

Jul 2, 2026

100

18-Month R&D, National Portuguese AI Model Amália Officially Launched

On July 1, Portugal launched its national AI large language model "Amália," built with Portuguese at its core to drive digital public services and strengthen technological sovereignty for Portugal and Europe. Developed over 18 months, it will empower education, defense, culture, healthcare, and administration, ensuring iterative development and autonomous AI infrastructure.....

Jul 2, 2026

Online Claims: SpaceX Shows AI Phone Prototype, Musk Denies Entirely

The Wall Street Journal claimed SpaceX demoed an AI phone prototype pre-IPO, which Musk called completely false on X. The report says the device combines a proprietary OS, xAI, and Qualcomm chips; early-stage R&D, design unset, mass production uncertain.....

Jul 2, 2026

New Tool for Meteorological Detection: Nari Radar Launches Rui Chen AI Meteorological Large Model and Phased Array Radar

Nari Radar has launched the WDSPT0152 model "S-band Full Polarization Active Phased Array Radar" and the accompanying Rui Chen AI Meteorological Large Model. The new radar integrates S-band and full polarization technologies, enhancing the ability to capture complex weather conditions; the AI model helps achieve ultra-precise short-term forecasts, promoting the development of meteorological monitoring toward high precision and intelligence.

Jul 2, 2026

Meta Follows SpaceX to Establish Cloud Business: Selling Idle AI Computing Power, Stock Rises 10%

Meta follows SpaceX's model to enter cloud services, reselling idle AI compute and opening its AI models. As a top Nvidia buyer, its recent layoffs aim to focus funds on AI infrastructure, with investment exceeding $10B this year alone.....

Jul 2, 2026

110

Apple Safari Preview Adds MCP Service AI Agent to Assist Web Development and Debugging

Apple WebKit team introduced MCP server in Safari Technology Preview 247, using AI agents to simplify front-end development and debugging. MCP (Model Context Protocol) is an open standard that enables AI agents to connect to external tools, databases, local files, and browser dev tools, allowing information access and authorized actions, bridging the data gap between AI and dev environments, improving programming agent collaboration efficiency.....

Jul 2, 2026

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

Website AI Friendliness Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Google Expands Free Quotas for Gemini API: Some Models' Throughput Rises to Hundreds of Thousands per Minute

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Moving Away from One-Size-Fits-All: Cloudflare Introduces Fine-Grained AI Traffic Management to Build a Defensive Barrier for Website Monetization

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

Rejecting the Cutthroat Collaboration: Kuaishou Tian Gong 3.2 Launches Skywork Tags to Make AI a Reliable Colleague in Work Groups

Alipay Abao Public Testing Launches: Say Farewell to Menu Navigation, Enter a New Era of Conversational Services

Alipay AI Life Assistant Aobao Officially Launches Public Testing, Entirely Removes Invitation Code Restrictions

18-Month R&D, National Portuguese AI Model Amália Officially Launched

Online Claims: SpaceX Shows AI Phone Prototype, Musk Denies Entirely

New Tool for Meteorological Detection: Nari Radar Launches Rui Chen AI Meteorological Large Model and Phased Array Radar

Meta Follows SpaceX to Establish Cloud Business: Selling Idle AI Computing Power, Stock Rises 10%

Apple Safari Preview Adds MCP Service AI Agent to Assist Web Development and Debugging

AI News Recommendations

Moving Away from One-Size-Fits-All: Cloudflare Introduces Fine-Grained AI Traffic Management to Build a Defensive Barrier for Website Monetization

Get Started with a Production-Grade Speech AI Agent in Two Minutes: xAI Launches Voice Agent Builder Beta Version

Rejecting the Cutthroat Collaboration: Kuaishou Tian Gong 3.2 Launches Skywork Tags to Make AI a Reliable Colleague in Work Groups

Alipay Abao Public Testing Launches: Say Farewell to Menu Navigation, Enter a New Era of Conversational Services

Alipay AI Life Assistant Aobao Officially Launches Public Testing, Entirely Removes Invitation Code Restrictions

18-Month R&D, National Portuguese AI Model Amália Officially Launched

Online Claims: SpaceX Shows AI Phone Prototype, Musk Denies Entirely

New Tool for Meteorological Detection: Nari Radar Launches Rui Chen AI Meteorological Large Model and Phased Array Radar

Meta Follows SpaceX to Establish Cloud Business: Selling Idle AI Computing Power, Stock Rises 10%

Apple Safari Preview Adds MCP Service AI Agent to Assist Web Development and Debugging