Microsoft Webwright Open Source: Web Agent Evolves from Click-Based to Code-Based
Microsoft Research has open-sourced the Webwright web agent framework, abandoning the traditional 'screenshot/DOM click' model and adopting a 'terminal-first' design, allowing AI models to directly write Playwright code and execute Bash commands in the terminal to efficiently complete complex web tasks. The framework's code is only about 1000 lines, with an extremely simple core architecture, emphasizing terminal operations over abstract interfaces.