Microsoft AI has unveiled a groundbreaking tool called Code Researcher, designed specifically for handling large system code and commit history.

This innovative tool aims to tackle the challenges of debugging and fixing crashes in complex system codes, such as the Linux kernel, marking another significant breakthrough for AI in software development. According to the latest public information obtained by AIbase, Code Researcher enhances the efficiency and accuracy of system-level software maintenance through multi-step reasoning and semantic analysis.

image.png

The Core Capabilities of Code Researcher

Code Researcher is an autonomous agent based on large language models (LLMs) with the ability to deeply analyze code repositories and commit histories. Unlike traditional coding tools, it can automatically trace the root causes of system crashes and generate repair patches by using semantic analysis, pattern recognition, and comprehensive processing of historical commit data. In the kBenchSyz benchmark tests targeting Linux kernel crashes, Code Researcher performed impressively, achieving a crash resolution rate of 58%, significantly surpassing SWE-agent's 37.5%. Additionally, Code Researcher was able to explore an average of 10 related files, compared to SWE-agent's 1.33 files, highlighting its powerful capability to deeply explore code repositories.

Broad Applicability and Practical Applications

Aside from the Linux kernel, Code Researcher has also demonstrated excellent versatility in testing open-source multimedia software. Through multi-faceted reasoning and global context collection, Code Researcher can provide high-quality crash repair solutions for various large codebases. This not only reduces the manual debugging burden on developers but also offers more efficient solutions for enterprise-level software maintenance. Microsoft AI stated that the launch of this tool will promote the automation process of system-level software development, saving developers valuable time.

Microsoft AI’s Code Researcher is not only a technical breakthrough but also a strong proof of the potential of AI in the field of software development. As AI agent technology continues to evolve, tools like Code Researcher are bringing us closer to artificial general intelligence (AGI). AIbase believes that the advent of this tool not only provides developers with powerful assistance but also sets a new benchmark for AI-driven development across the industry.

Paper: https://www.microsoft.com/en-us/research/publication/code-researcher-deep-research-agent-for-large-systems-code-and-commit-history/