Claude Sonnet 4 model from Anthropic now supports up to 1 million tokens

AIbase基地

Published inAI News · 4 min read · Aug 13, 2025

Artificial intelligence startup Anthropic recently announced that its highly anticipated Claude Sonnet4 LLM model now supports up to 1 million context tokens. Previously, the model's API only supported 200,000 tokens. This expansion allows developers to transmit over 75,000 lines of code in a single request, greatly enhancing flexibility and convenience.

Currently, the extended long-context support is now available for public testing on Anthropic's API and Amazon Bedrock, with Google Cloud Vertex AI also set to launch this feature soon. However, this long-context feature is currently limited to Tier4 developers and requires adherence to custom rate limits. Anthropic stated that within the next few weeks, this feature will be opened to more developers.

With the increased demand for computational power from larger token windows, Anthropic also launched a new pricing plan. For prompts under 200,000 tokens, the cost of Sonnet4 is $3 per million input tokens and $15 per million output tokens. For prompts exceeding 200,000 tokens, the cost is $6 per million input tokens and $22.5 per million output tokens. Developers can also reduce costs by using fast caching and batch processing techniques, with batch processing offering a 50% discount on the pricing for a 1M context window.

In a recent Reddit AMA session, OpenAI executives discussed the possibility of supporting long context windows for their models. OpenAI's CEO Sam Altman said they have not yet observed strong user demand for long context, but if there is enough user interest, they would consider adding support. Due to limitations in computational power, the OpenAI team wants to focus on other priority projects. Additionally, OpenAI team member Michelle Pokrass mentioned that they had hoped to support up to 1 million tokens of context in GPT-5, especially for API applications, but were unable to do so due to the excessive GPU requirements.

Anthropic's 1M context support directly competes with Google Gemini in long-context features, putting pressure on OpenAI to reconsider its product roadmap.

Key Points:
🆕 The Claude Sonnet4 model from Anthropic now supports up to 1 million context tokens, greatly enhancing development flexibility.
💰 A new pricing plan has been introduced, with different costs for prompts below and above 200,000 tokens, and developers can reduce costs through batch processing.
🤖 OpenAI is paying attention to the demand for long context, and may adjust its product roadmap in the future to respond to competition.

Empower AI Agents! Anthropic Officially Launches the Claude Skills Open Standard

Anthropic upgrades the 'Claude Model Skills' feature, driving AI towards autonomous task processing agents. This update officially releases 'agent skills' as an open standard, allowing developers and enterprises to train AI more conveniently for specific tasks, marking a rapid shift in artificial intelligence from large models to task-management agents.

Major Changes in Driverless Freight! Cainiao Plans to Invest in Jiushih Intelligent, Two Major Logistics Giants Aim to Integrate Operations

Cainiao Group plans to invest in Jiushi Intelligence, with both parties in confidential talks for deep integration in unmanned vehicle operations to restructure resources and form a joint entity. Cainiao may authorize Jiushi to use its brand. Neither party has commented yet. Cainiao has strong technical expertise in unmanned vehicles.....

Microsoft Open-Sources TRELLIS.2: Convert Images to High-Precision 3D Models with One Click

Microsoft open-sources the image-to-3D tool TRELLIS.2, which can quickly generate textured 3D models from a single image, outputting .glb format files compatible with platforms such as Blender and Unity. The tool uses a 4B model and supports image processing at resolutions ranging from 512³ to 1536³. On an NVIDIA H100 GPU, generating a 512³ model takes approximately 3 seconds.

Japanese Scientists Launch Sui Programming Language, Claiming It Can Write Code with 100% Accuracy Using LLM

Japanese data scientist Takahito Honda launched the open-source programming language Sui, aimed at solving the accuracy issues of code generated by large language models, claiming it can achieve 100% accuracy. Its design concept is inspired by Japanese aesthetics "Wabi-Sabi", emphasizing refinement and elimination of redundancy, with core principles including ensuring zero syntax error rate and using numbers as variables.

AI Technology Enhances the Trailer of 'Avengers 5' to 4K High Definition

The trailer of Marvel's 'Avengers 5' was accidentally leaked, featuring Chris Evans returning as Captain America, embracing an infant to portray a father figure. Due to the blurry quality of the偷拍 footage, netizens expressed dissatisfaction, and tech enthusiasts have already used AI technology to restore the trailer.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Claude Sonnet 4 model from Anthropic now supports up to 1 million tokens

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Alibaba Tongyi's Major Open-Source Bomb! A Voice AI That Can Understand Emotions Has Arrived, Will GPT-4o Tremble?

Anthropic Launches a New Open-Source Agent Skills Knowledge Base to Enhance Claude Model Capabilities

Empower AI Agents! Anthropic Officially Launches the Claude Skills Open Standard

Amazing Endurance! Anthropic's Flagship Model Claude Opus4.5 Sets a New Record for Long Task Processing

Preventing Risks for Minors: OpenAI and Anthropic to Launch AI Age Prediction Feature

Major Changes in Driverless Freight! Cainiao Plans to Invest in Jiushih Intelligent, Two Major Logistics Giants Aim to Integrate Operations

Microsoft Open-Sources TRELLIS.2: Convert Images to High-Precision 3D Models with One Click

Japanese Scientists Launch Sui Programming Language, Claiming It Can Write Code with 100% Accuracy Using LLM

OpenAI Suddenly Releases a Big Move at Midnight: GPT Image 1.5 is Free and Open to All, with 4 Times Faster Generation Speed, and the 'Otoman' Posts a Photo of a Male Model that Goes Viral

AI Technology Enhances the Trailer of 'Avengers 5' to 4K High Definition

AI News Recommendations

Alibaba Tongyi's Major Open-Source Bomb! A Voice AI That Can Understand Emotions Has Arrived, Will GPT-4o Tremble?

Anthropic Launches a New Open-Source Agent Skills Knowledge Base to Enhance Claude Model Capabilities

Empower AI Agents! Anthropic Officially Launches the Claude Skills Open Standard

Amazing Endurance! Anthropic's Flagship Model Claude Opus4.5 Sets a New Record for Long Task Processing

Preventing Risks for Minors: OpenAI and Anthropic to Launch AI Age Prediction Feature

Major Changes in Driverless Freight! Cainiao Plans to Invest in Jiushih Intelligent, Two Major Logistics Giants Aim to Integrate Operations

Microsoft Open-Sources TRELLIS.2: Convert Images to High-Precision 3D Models with One Click

Japanese Scientists Launch Sui Programming Language, Claiming It Can Write Code with 100% Accuracy Using LLM

OpenAI Suddenly Releases a Big Move at Midnight: GPT Image 1.5 is Free and Open to All, with 4 Times Faster Generation Speed, and the 'Otoman' Posts a Photo of a Male Model that Goes Viral

AI Technology Enhances the Trailer of 'Avengers 5' to 4K High Definition

GEO Services