Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Tools

GEO Brand Visibility

All-in-One GEO Brand Insights Platform

AI Visibility Audit

Quickly check how your brand is perceived and presented in AI-powered search results.

AI Search Visibility Checker

Detect brand's visibility on AI platforms

GEO Ranking Monitor

Batch queries & scheduled GEO ranking tracking

AI Conversation Insight

Discover trending questions users ask AI to guide content strategy

GEO Promotion Link Detection

Quickly evaluate the citation of promotion articles on AI platforms

Website AI Friendliness Detection

Quickly Check If Your Website Is AI-Search-Friendly And How To Optimize It

Service

GEO Ranking Optimization System

Own your own GEO system and become a professional GEO optimization service provider.

GEO Ranking Optimization

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

Information

LLM API Hub

One-stop integration for all major LLM APIs.

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Tools

LLM API Proxy Checker

Choose reliable LLM API proxies with our 5-dimension test

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

AI Deployment Calculator

Enter Your Large Model Computing Requirements for Instant GPU, Memory & Server Configuration Recommendations

Microsoft Webwright Open Source: Web Agent Evolves from Click-Based to Code-Based

AIbase基地

Published inAI News · 6 min read · May 26, 2026

Microsoft Research has recently open-sourced its new web agent framework Webwright. This framework moves away from the current mainstream "screenshot/DOM click" prediction model and instead allows AI models to directly write Playwright code and execute Bash commands within the terminal, completing complex web tasks in a more efficient and logical way.

I. Core Architecture: A Minimalist "Terminal-First" Paradigm

Webwright's design philosophy is very hardcore—"One terminal beats thousands of abstractions." The entire framework consists of approximately 1,000 lines of code, composed of three core modules, with no complex multi-agent orchestration:

Runner (about 150 lines): Manages the core logic of the agent loop, handling context and execution.
Model Endpoint (about 550 lines): A unified interface for model interaction, supporting backends such as OpenAI, Anthropic, and OpenRouter.
Terminal Environment (about 300 lines): Provides an isolated terminal execution environment where the model can run Playwright scripts, view logs, analyze screenshots, and perform debugging.

Workflow: The Runner sends the current task context to the model → the model generates a "thought process" and "Shell command" → the environment executes and returns results (output, screenshot, error stack) → enters the next round of loops until the task is completed.

II. Why Shift from "Clicking" to "Writing Code"?

Current mainstream agents operate the browser by continuously predicting "clicks, scrolls, inputs," a mode that faces efficiency issues and challenges in maintaining state. Webwright's code-driven approach offers significant advantages:

Logical Reuse: Each operation generates reusable RPA (Robotic Process Automation) scripts, rather than one-time click records. These scripts can be called in other tools like Claude Code or Codex.
Complex Logic Handling: Code naturally supports loops, functions, and logical branches. For long-chain tasks like form filling, cross-page operations, and conditional jumps, code expression is far superior to simple action stacking.
Engineering Error Correction: Through stack analysis after execution errors, the model can autonomously enter a "write code - run - error - fix" iterative cycle, significantly improving task success rates.

III. Engineering Breakthroughs: Solving "False Success" and "Context Bloat"

To address two common pain points in agents, Webwright introduces targeted solutions:

Gate Self-Check Mechanism: Prevents the model from falsely declaring a task complete. The model must first generate a "self-check configuration" and run the final script in a clean environment. It can only output a completion marker after self-reflection confirms the task was truly achieved.
History Compression: To address context overload caused by long trajectories, the system compresses the history into a summary every 20 steps, ensuring the context window always focuses on key progress.

IV. Test Performance: Outperforming the Benchmark

In May 2026 benchmark tests, Webwright performed exceptionally well:

Online-Mind2Web: Webwright based on GPT-5.4 achieved an accuracy of 86.67% within a 100-step budget, ranking among the top open-source solutions.
Odysseys (Long-Chain Tasks): Facing complex instructions averaging 272 words, Webwright + GPT-5.4 achieved a score of 60.1%, representing an increase of about 81.5% compared to the base GPT-5.4 (33.5%), surpassing the champion model Opus4.6 (44.5%) from the April leaderboard.

Industry Feedback

The emergence of Webwright highlights an important trend: as model programming capabilities improve, agents are transitioning toward a "developer paradigm." By viewing the browser as a programmable endpoint rather than just an interactive interface, Webwright successfully elevates the efficiency and robustness of AI web task execution to a new level.

For developers, Webwright is not just an agent framework but also a "super employee" that can automatically write, maintain, and package automation scripts. The project is now open source on GitHub.

Webwright AI New Term Microsoft Research Playwright

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Nuoyi Successfully Completes the Registration of the Intelligent Entity Large Model, the World's First AI Intelligent Entity Phone Will Be Unveiled at WAIC 2026

Nubia President Ni Fei announced the AI agent large model filing is complete. The world's first AI agent phone, a mass-production flagship with embedded Doubao assistant, will debut on July 17 at WAIC 2026. A teaser shows a light pink design with 'nubia' centered on the back.....

Jul 16, 2026

Thinking Machines Launches First Open-Source Large Model Inkling, Focuses on Customized AI Against One-Size-Fits-All AI

Thinking Machines Lab releases Inkling, a 975B-parameter MoE open-source model that activates only a subset of parameters per task for customisation, challenging closed-source AI dominance.....

Jul 16, 2026

Suno Integrates with iMessage: iPhone Users Can Directly Use AI to Create Songs in Chats, Daily Production Exceeds 7 Million Songs

Suno更新于7月7日，允许iPhone用户直接在iMessage中生成AI歌曲。通过iMessage应用抽屉，无需离开聊天界面，即可用文本提示或语音备忘录创作歌曲，可选择音乐流派，并可多次刷新获取变体，不满意可重试。....

Jul 16, 2026

Nubia Leads in Completing the Registration of the AI Agent Large Model, Global First AI Agent Smartphone to Be Unveiled

Nubia completes AI agent large model filing, will launch the world's first AI agent phone, announced by CEO Ni Fei. The phone aims to let users confidently delegate tasks, greatly improving daily efficiency.....

Jul 15, 2026

210

China's First AI Personification Interactive Service Regulation Takes Effect Today, Six Red Lines Clearly Defined, Prohibiting the Provision of Virtual Partners to Minors

Five departments jointly enacted the Interim Measures for AI Anthropomorphism Interaction Services, effective immediately. It mandates safety responsibilities for providers and classifies emotion-companion AI under tiered oversight, signaling standardized regulation. Six activities are expressly banned.....

Jul 15, 2026

250

WAIC2026 Conference Goes Full Steam! MINIMAX and Zhipu Stock Prices Surge Over 8%

Ahead of the 2026 World AI Conference, Hong Kong-listed AI large model stocks rallied. On July 15, MINIMAX-W soared 12.52%, Zhipu gained 8.81%, and Qunhe Technology rose 6.11%, fueled by conference optimism and target price upgrades from major banks. WAIC 2026 runs July 17-20 in Shanghai, highlighting three core AI themes.....

Jul 15, 2026

280

AI Daily: DouBao and QianWen Discontinue Smart Agent Features on the Same Day; GPT-5.6Sol Exposed to Automatically Delete User Databases; JD AI Agent and Tencent Yuanbao Integrate Mini-Program Ecosystem

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we bring you the latest in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Discover new AI products: https://app.aibase.com/zh1, OpenAI releases GPT-5.6Sol, triggering security warnings. The new flagship model is exposed to automatically delete user databases. OpenAI releases its next-generation flagship model, GPT-5.6Sol

Jul 15, 2026

980

JD.com AI Agent and Tencent Yuanbao Officially Connected the Mini Program Ecosystem, Users Can Directly Place Orders to Shop in Conversations

JD.com and Tencent announced integration of JD’s AI agent with Tencent Yuanbao mini-programs. The agent provides full-category product info and end-to-end shopping within Yuanbao chats—from search to purchase, no redirect. JD self-developed the agent, Yuanbao’s first e-commerce vertical partner, already connected to multiple phone makers’ AI agents.....

Jul 15, 2026

270

France Stands Out in the AI Competition with Low-Priced Electricity

At the G7 summit, Mistral AI CEO Arthur Mensch highlighted France's advantage in AI due to abundant low-cost nuclear power, noting France exported 92 TWh of electricity last year. He stressed electricity is key to AI development.....

Jul 15, 2026

240

Google Chrome Android Version Redesigns Bottom Toolbar: Adds Dedicated Gemini Button and Supports Multi-Tab AI Analysis

Google is testing a new bottom navigation bar in Chrome 150 for Android, featuring a dedicated Gemini AI button for the first time. The new AI breaks through the single-page summary limitations, allowing cross-comparison and summarization of content across multiple open tabs. It fully transfers the deep search experience of the desktop sidebar to mobile devices, marking a structural upgrade in AI interaction for mobile browsers.

Jul 15, 2026

280