OpenAI releases GPT-5.1-Codex-Max: Introduces context compression mechanism, SWE-bench accuracy improves to 77.9%

AIbase基地

Published inAI News · 2 min read · Nov 20, 2025

OpenAI announced the launch of GPT-5.1-Codex-Max, designed for complex software engineering projects, supporting context consistency generation for tens of thousands of lines of code. The new model introduces a "compaction" dynamic compression mechanism, which can automatically organize memory and retain key states during task execution, significantly reducing the risk of information loss in long sessions.

Performance Data

- The accuracy rate on SWE-bench increased from 73.7% to 77.9%, with independent contribution software engineer tasks jumping to 79.9%, and token consumption reduced by approximately 12%.

- The "illicit" content detection score increased from 0.860 to 0.920, but OpenAI noted that cybersecurity capabilities have not yet reached the "high capability" standard and still require manual review.

Codex-Max has replaced the previous version as the default model in the Codex series, available to developers and enterprises through ChatGPT Enterprise, API, and GitHub Copilot. The pricing remains at $5 per million input tokens and $15 per output token, with a 50% discount for bulk calls. OpenAI plans to launch the "Codex-Max-Enterprise" dedicated version in Q1 2026, supporting private deployment and custom code style rules.

GPT-5.1-Codex-Max compaction SWE-bench illicit

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Unveils Gemini 3: A 1 Million Token Context Window Competing with GPT-5.1, Ranking Top on LMArena

Google launches Gemini 3, offering for the first time a 1 million tokens context window, natively supporting multi-modal reasoning for text, images, videos, and code. Gemini 3 Pro achieves a 91.9% accuracy rate on GPQA testing, ranking top on LMArena with 1501 points, surpassing GPT-5.1 and Claude 4.5. It adopts the Deep Think reasoning mode, productizing the 'thinking chain' through 'thought signatures', showing outstanding performance in logical, factual, and scientific reasoning.

Nov 19, 2025

140

Domestic AI Model Kimi K2 Successfully Integrated with Perplexity, Marking a Significant Step

The domestic Kimi K2 Thinking model has successfully integrated with the globally renowned AI search application Perplexity, becoming the only domestic model to join the platform. This integration, occurring simultaneously with OpenAI's GPT-5.1, highlights the international competitiveness of domestic AI technology. Perplexity, a conversational answer engine established in 2022, has grown into the highest-valued AI search application globally, revolutionizing the way users access information.

Nov 18, 2025

190

GPT-5.1 Update Helps Developers Achieve Dual Improvements in Speed and Cost

OpenAI's GPT-5.1 update enhances performance, speed, and cost-efficiency. It introduces adaptive reasoning to adjust processing speed based on query complexity, reducing wait times for developers.....

Nov 14, 2025

250

OpenAI Launches ChatGPT Group Chat Feature, Will Pilot in South Korea and New Zealand

OpenAI pilots ChatGPT group chat in South Korea and New Zealand, enabling multiple users to interact with AI simultaneously using GPT-5.1. Rate limits apply only to AI responses, enhancing collaborative experience.....

Nov 14, 2025

180

Alibaba Cloud Large Model Prices Cut in Half! Tongyi Qianwen 3-Max Call Cost Reduced by 50%, Only 10% Fee Charged for Cache Hits

Alibaba Cloud BaiLian announced that starting from November 13, 2025, the core call fee for the Tongyi Qianwen 3-Max model will be halved, and the cache billing strategy has been optimized, significantly reducing the cost of enterprise AI applications. This move aims to lower the entry barrier for large model usage and accelerate digital transformation for small and medium enterprises.

Nov 14, 2025

170

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

The latest AI programming model rankings from LMArena show that Claude from Anthropic, GPT-5 from OpenAI, and Zhipu GLM-4.6 are tied for first place globally. These models, designed specifically for programming, can significantly improve the efficiency of code writing, debugging, and optimization, driving advancements in software development.

Nov 13, 2025

270

GPT-5.1 Officially Released! Not Only Smarter, but Also Empathetic: Added 6 Personality Styles, AI's First Emotional Dependency Safety Assessment

OpenAI released GPT-5.1, marking a new stage in the AI competition where emotional intelligence is integrated. The model focuses on enhancing emotional value, personalized interaction, and human-like expression, addressing user feedback about AI feeling cold. It uses a dual-model architecture: Instant mode for quick responses, and Thinking mode for deep thinking. It is being rolled out globally in batches, and paying users can continue using GPT-5 for transition within the next 3 months.

Nov 13, 2025

330

OpenAI Launches GPT-5.1: A Faster, More Accurate, and More Human-Like Personal AI Assistant

OpenAI launches GPT-5.1, upgrading ChatGPT to a more responsive, personalized digital assistant with adaptive dialogue styles for warmer, precise interactions.....

Nov 13, 2025

220

100 Million USD Series A Funding! Israeli AI Agent Startup Wonderful Emerges as a Rising Star with an 80% Problem Resolution Rate, Igniting the Global Customer Service Market

Israeli AI platform Wonderful has completed a 100 million USD Series A funding round, bringing total funding to 134 million USD. Unlike GPT shell products, it rapidly gains traction in the global enterprise market through deep integration and localized deployment, attracting the attention of top-tier venture capital firms and demonstrating strong commercial application potential.

Nov 12, 2025

210

AI Daily: Moon's Dark Side Opens New AI Framework Kosong; Baidu Releases New Model ERNIE-4.5-VL; GPT-5.1 Makes a Stealth Appearance

AI Daily introduces Kosong, an open-source AI agent framework by Moonshot AI. It features asynchronous tool orchestration and plugin design, supports Python out-of-the-box, and enhances developer flexibility for AI innovation.....

Nov 11, 2025

230

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

OpenAI releases GPT-5.1-Codex-Max: Introduces context compression mechanism, SWE-bench accuracy improves to 77.9%

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Unveils Gemini 3: A 1 Million Token Context Window Competing with GPT-5.1, Ranking Top on LMArena

Domestic AI Model Kimi K2 Successfully Integrated with Perplexity, Marking a Significant Step

GPT-5.1 Update Helps Developers Achieve Dual Improvements in Speed and Cost

OpenAI Launches ChatGPT Group Chat Feature, Will Pilot in South Korea and New Zealand

Alibaba Cloud Large Model Prices Cut in Half! Tongyi Qianwen 3-Max Call Cost Reduced by 50%, Only 10% Fee Charged for Cache Hits

LMArena Releases Latest Large Model Rankings: Claude, GPT-5, and Zhipu GLM-4.6 Tie for First

GPT-5.1 Officially Released! Not Only Smarter, but Also Empathetic: Added 6 Personality Styles, AI's First Emotional Dependency Safety Assessment

OpenAI Launches GPT-5.1: A Faster, More Accurate, and More Human-Like Personal AI Assistant

100 Million USD Series A Funding! Israeli AI Agent Startup Wonderful Emerges as a Rising Star with an 80% Problem Resolution Rate, Igniting the Global Customer Service Market

AI Daily: Moon's Dark Side Opens New AI Framework Kosong; Baidu Releases New Model ERNIE-4.5-VL; GPT-5.1 Makes a Stealth Appearance

GEO Services