GPT Image2's Team Revealed for the First Time: A Core Team of 13 People, 4 Months to Redesign AI Drawing

AIbase基地

Published inAI News · 4 min read · Apr 23, 2026

Recently, GPT Image2 has caused a sensation on social media with its incredibly impressive generation effects. As the project gained popularity, the low-key behind-the-scenes team gradually came into the spotlight. According to the information, the core team consists of only 13 people, and they completely rewrote the underlying architecture in just four months. Although the research leader Chen Boyuan did not reveal specific technical routes, he described this new model as "GPT for the image field," indicating a significant leap in generality.

As the team's key figure, Chen Boyuan has a rather legendary growth journey. During his doctoral studies, he proposed innovative paradigms such as "Diffusion Forcing" and participated in the development of instruction tuning technology later adopted by Gemini 2.0 at Google. Interestingly, he didn't even know Python when he joined a science camp in high school. After joining OpenAI, he not only took charge of all the training work for the GPT image model but was also a core member of the Sora video generation team. In a demonstration, he showcased the model's excellent language processing capabilities by generating posters with accurately rendered Chinese, Korean, and Bengali text.

In addition to text rendering, GPT Image2 has also reached a new level in understanding world knowledge and following instructions. This module, led by Dr. Jianfeng Wang from University of Science and Technology of China, has solved a long-standing pain point for image generation AI—such as past models always drawing clocks at 10:10, while the new model can now accurately understand any time point and complex spatial layout instructions. He stated that the model is eliminating the gap between users' creative intentions and the final output.

In terms of productivity tools, Yuguang Yang from Zhejiang University's Zhuyuan College demonstrated the ability to convert lengthy papers into high-precision PPTs and infographics with one click. This is due to the team's deep integration in multimodal understanding, MoE (Mixture of Experts) architecture, and long-range guidance technology.

From the initial DALL-E to today's GPT Image2

GPTImage2 AI New Words OpenAI DiffusionForcing

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we bring you the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. ByteDance Launches Seed3D2.0: Dual SOTA in Geometry and Texture, API Now Available on Volcano Engine. The Seed Team of ByteDance has released the new generation 3D generation large model Seed3D2.

Apr 23, 2026

Goodbye to Feature Vehicles! Volkswagen Unveils the Global Intelligent Agent Roadmap, the AI-Defined Automotive Era Has Arrived

Volkswagen unveiled its Global Intelligent Agent AI roadmap ahead of the 2026 Beijing Auto Show, planning to fully deploy intelligent agent AI on new models based on the CEA architecture starting in 2026. In 2027, the CEA 2.0 architecture will be launched, achieving 'integrated cockpit and driving', accelerating the transformation toward 'AI-defined cars'.

Apr 23, 2026

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Tencent launches open-source AI model Hy3Preview with 29.5B parameters, supporting long contexts and improved performance in reasoning, instruction following, and code processing, emphasizing practicality, real-world evaluation, and cost-effectiveness.....

Apr 23, 2026

310

Hangzhou Intermediate People's Court Adjudicates First Case of AI Smart Body Traffic Hijacking Unfair Competition

Hangzhou Intermediate People's Court is hearing a case of unfair competition involving traffic hijacking by an AI smart body. The plaintiff, a company operating an AI application that integrates search and writing functions, has become widely popular and accuses the defendant of using technical means to hijack its user traffic. The case was opened on the eve of World Intellectual Property Day, with President Tang Xuebing presiding as the chief judge, highlighting the attention paid to the judicial regulation of new forms of competitive behavior in the AI field.

Apr 23, 2026

100

Musk Rumored to Join Pinduoduo, Netizens Exclaim Truth Is No Longer Clear in the AI Era

Recent online fabrications include fake announcements of Apple's Tim Cook as Xiaomi Auto's CEO and Elon Musk joining Pinduoduo, prompting official denials and warnings against misinformation.....

Apr 23, 2026

110

OpenAI Releases Preview of ChatGPT Autonomous Workspace Agent

OpenAI introduces 'Workspace Agents', transforming ChatGPT into autonomous 'digital employees' for enterprise and education users. Built on custom GPTs, it offers high autonomy and connectivity, powered by Codex for automated task execution.....

Apr 23, 2026

140

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

AutoNet launches an AI Agent for automotive travel, based on the Qwen large model, achieving a transformation from "passive command response" to "active intention understanding" in in-car navigation. The system adopts a dual-engine architecture of "language brain" and "spatial brain", where the former interprets everyday language and the latter verifies intentions in the physical world and matches route resources. It aims to solve the pain point of "people adapting to systems" and enhance the intelligent cockpit experience.

Apr 23, 2026

110

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time

Alibaba's AI assistant Qianwen opens its Agent technology to business partners, integrating with China Eastern Airlines to enable full flight booking via natural language commands, eliminating traditional interface operations.....

Apr 23, 2026

150

Microsoft Officially Redesigns the Edge Browser Interface, Showing a Unified AI Style

Microsoft is redesigning Edge with rounded corners, unified colors and fonts, integrating Copilot and Bing for a consistent, modern cross-platform experience.....

Apr 23, 2026

100

U.S. Senator Warren Warns: AI Industry Bubble May Trigger a New Financial Crisis

U.S. Senator Warren warns that the AI industry faces risks similar to the 2008 financial crisis, pointing out that its rapid development and high leverage financing may create a bubble, threatening financial stability. She emphasizes that AI company revenue growth is not keeping up with increasing expenses, indicating unhealthy financial conditions.

Apr 23, 2026

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

GPT Image2's Team Revealed for the First Time: A Core Team of 13 People, 4 Months to Redesign AI Drawing

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Goodbye to Feature Vehicles! Volkswagen Unveils the Global Intelligent Agent Roadmap, the AI-Defined Automotive Era Has Arrived

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Hangzhou Intermediate People's Court Adjudicates First Case of AI Smart Body Traffic Hijacking Unfair Competition

Musk Rumored to Join Pinduoduo, Netizens Exclaim Truth Is No Longer Clear in the AI Era

OpenAI Releases Preview of ChatGPT Autonomous Workspace Agent

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time

Microsoft Officially Redesigns the Edge Browser Interface, Showing a Unified AI Style

U.S. Senator Warren Warns: AI Industry Bubble May Trigger a New Financial Crisis

AI News Recommendations

AI Daily: ByteDance Launches Seed3D2.0; Xiaomi MiMo-V2.5 Beta Test; Alibaba Qwen3.6-27B Officially Open Sourced

Goodbye to Feature Vehicles! Volkswagen Unveils the Global Intelligent Agent Roadmap, the AI-Defined Automotive Era Has Arrived

Tencent Launches New Open-Source Language Model Hy3 Preview, Leading the Trend of the Intelligent Era

Hangzhou Intermediate People's Court Adjudicates First Case of AI Smart Body Traffic Hijacking Unfair Competition

Musk Rumored to Join Pinduoduo, Netizens Exclaim Truth Is No Longer Clear in the AI Era

OpenAI Releases Preview of ChatGPT Autonomous Workspace Agent

AutoNet Launches AI Agent for Automotive Travel: Achieving Active Intention Understanding Based on Qwen Large Model

Bloomberg: Alibaba's AI Assistant Qwen Opens Agent Technology Access to China Eastern Airlines for the First Time

Microsoft Officially Redesigns the Edge Browser Interface, Showing a Unified AI Style

U.S. Senator Warren Warns: AI Industry Bubble May Trigger a New Financial Crisis