From Closed Source to Open Source: OpenAI Unveils GPT-OSS-120B and 20B, a Developer Celebration!

AIbase基地

Published inAI News · 8 min read · Aug 6, 2025

OpenAI announced the release of two open-source weight language models—gpt-oss-120b and gpt-oss-20b, marking OpenAI's return to the open-source domain for the first time since the release of GPT-2 in 2019. This move not only signifies a major shift in OpenAI's strategy but also provides global AI developers with powerful reasoning tools, accelerating the popularization and innovation of AI technology.

Open Weight Models, Granting Developers Greater Freedom

According to OpenAI's official announcement, gpt-oss-120b and gpt-oss-20b are released under the Apache 2.0 license, allowing developers to freely download, modify, and use them for commercial purposes. These two models are mixture-of-experts (MoE) architectures with 117 billion and 21 billion parameters respectively, activating 5.1 billion and 3.6 billion parameters, balancing efficient reasoning with low resource consumption.

gpt-oss-120b: Can run on a single NVIDIA H100 GPU (80GB memory), suitable for data centers or high-end enterprise scenarios, with performance close to OpenAI's proprietary model o4-mini, especially excelling in competitive programming (Codeforces), general problem solving (MMLU, HLE), and health-related queries (HealthBench).
gpt-oss-20b: Can run on edge devices with just 16GB of memory, suitable for local reasoning and device-side applications, with performance comparable to o3-mini, particularly excelling in competitive mathematics (AIME2024 & 2025) and other fields.

These models support a context length of up to 128k tokens, use alternating dense and locally banded sparse attention mechanisms, and employ grouped multi-query attention to improve reasoning efficiency. OpenAI has also open-sourced the 'o200k_harmony' tokenizer, further lowering the development barrier.

Safety and Responsibility, Redefining Open Source Standards

OpenAI emphasized that safety is a core principle of the gpt-oss series. To address the risk of malicious fine-tuning of open-source models, OpenAI conducted adversarial fine-tuning tests on gpt-oss-120b and verified it through its "Preparedness Framework," confirming that even after malicious optimization, the model does not reach a high-risk capability level in areas such as biology, chemistry, and cybersecurity. External security experts have further enhanced the model's security standards.

Additionally, OpenAI urges developers to implement additional safety measures based on their needs when using the model to address potential risks in diverse application scenarios. The model card and research paper detail the security test results, providing transparent references for the open-source community.

Strategic Shift, Responding to Open Source Competition and Enterprise Needs

This move by OpenAI is seen as a strategic adjustment in response to competition in the open-source field. In recent years, companies like Meta and DeepSeek have captured the market by opening up their models, forcing OpenAI to re-evaluate its closed-source strategy. Sam Altman, CEO of OpenAI, stated in a Reddit AMA that the company's previous open-source approach was "on the wrong path," and this release marks the first step in fulfilling its commitment to returning to open source.

At the same time, the gpt-oss series meets the needs of enterprises for localized deployment and data privacy. Financial, healthcare, and legal industries with strict regulations can deploy the models on private servers, avoiding the risk of data leaks. OpenAI has also partnered with institutions such as AI Sweden to explore regional fine-tuning to enhance the model's performance in specific languages and cultural contexts.

Empowering Developer Ecosystems, Unlocking New Possibilities in AI

The gpt-oss series supports multiple development frameworks, such as Transformers, vLLM, Ollama, and llama.cpp. Developers can download the model weights from platforms like Hugging Face and GitHub and quickly get started using reference code provided by OpenAI. The model includes chain-of-thought reasoning, tool calling (supporting Python code execution, web search, etc.), and structured output (JSON, YAML, etc.), making it ideal for building intelligent agent workflows.

Additionally, the model supports three levels of reasoning (low, medium, high), allowing developers to balance speed and accuracy based on task requirements.

The release of gpt-oss not only provides developers with high-performance, low-cost AI tools but also has a profound impact on the AI industry landscape. Compared to Meta's Llama or DeepSeek's R1, gpt-oss has clear advantages in reasoning capabilities and tool usage, but its limitation to a single-text modality means that multi-modal functionality must be supplemented via API calls.

OpenAI stated that it will continue to optimize the gpt-oss series based on community feedback, but did not commit to specific update plans. Industry experts believe this move may encourage more enterprises to adopt hybrid AI strategies, combining open-source models with cloud services to balance cost and flexibility.

Official blog: https://openai.com/zh-Hans-CN/index/introducing-gpt-oss/

Unitree Technology Launches G1-D: A Humanoid Robot Workstation That Integrates Data Collection, Training, and Deployment

Yushu Tech launches a full-stack solution with wheeled humanoid robot G1-D, integrating data collection, processing, annotation, review, and asset management. It offers one-stop AI model development, including distributed training, custom model creation, deployment, and compatibility with open-source models.....

Weibo Launches VibeThinker-1.5B, a Low-Cost AI Model Challenging Large Language Models

The Weibo AI department has launched the open-source large model VibeThinker-1.5B, which has 1.5 billion parameters. The model is optimized based on Alibaba's Qwen2.5-Math-1.5B and performs well in math and code tasks. It is now freely available on platforms such as Hugging Face, and it follows the MIT license, supporting commercial use.

Baidu releases ERNIE-4.5-VL-28B-A3B-Thinking: Accurately locates image details to solve complex problems

Baidu launches the multimodal AI model ERNIE-4.5-VL-28B-A3B-Thinking, which can deeply integrate images for reasoning. The model performs excellently in multiple benchmark tests, sometimes surpassing top commercial models such as Google Gemini 2.5 Pro and OpenAI GPT-5 High. Although it has a total of 28 billion parameters, it uses a routing architecture, activating only 3 billion parameters, achieving lightweight and efficient inference.

GPT-5.1 Officially Released! Not Only Smarter, but Also Empathetic: Added 6 Personality Styles, AI's First Emotional Dependency Safety Assessment

OpenAI released GPT-5.1, marking a new stage in the AI competition where emotional intelligence is integrated. The model focuses on enhancing emotional value, personalized interaction, and human-like expression, addressing user feedback about AI feeling cold. It uses a dual-model architecture: Instant mode for quick responses, and Thinking mode for deep thinking. It is being rolled out globally in batches, and paying users can continue using GPT-5 for transition within the next 3 months.

100 Million USD Series A Funding! Israeli AI Agent Startup Wonderful Emerges as a Rising Star with an 80% Problem Resolution Rate, Igniting the Global Customer Service Market

Israeli AI platform Wonderful has completed a 100 million USD Series A funding round, bringing total funding to 134 million USD. Unlike GPT shell products, it rapidly gains traction in the global enterprise market through deep integration and localized deployment, attracting the attention of top-tier venture capital firms and demonstrating strong commercial application potential.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

From Closed Source to Open Source: OpenAI Unveils GPT-OSS-120B and 20B, a Developer Celebration!

AIbase基地

Open Weight Models, Granting Developers Greater Freedom

Safety and Responsibility, Redefining Open Source Standards

Strategic Shift, Responding to Open Source Competition and Enterprise Needs

Empowering Developer Ecosystems, Unlocking New Possibilities in AI

This article is from AIbase Daily

AI News Recommendations

Unitree Technology Launches G1-D: A Humanoid Robot Workstation That Integrates Data Collection, Training, and Deployment

Baidu Launches the New Native Multimodal Large Model ERNIE 5.0

Google Launches Private AI Computing Cloud System: Zero Access to AI Data in Isolated Environments

Weibo Launches VibeThinker-1.5B, a Low-Cost AI Model Challenging Large Language Models

Baidu releases ERNIE-4.5-VL-28B-A3B-Thinking: Accurately locates image details to solve complex problems

GPT-5.1 Officially Released! Not Only Smarter, but Also Empathetic: Added 6 Personality Styles, AI's First Emotional Dependency Safety Assessment

OpenAI Launches GPT-5.1: A Faster, More Accurate, and More Human-Like Personal AI Assistant

100 Million USD Series A Funding! Israeli AI Agent Startup Wonderful Emerges as a Rising Star with an 80% Problem Resolution Rate, Igniting the Global Customer Service Market

Deciphering Historical Manuscripts Gemini 3 First Approaches Human Experts

DeepSeek Senior Researcher Warns: Artificial Intelligence May Replace Most Human Jobs Within Ten Years

GEO Services