Google AI Vulnerability Hunter Big Sleep Achieves First Victory: Discovers 20 Security Vulnerabilities in Open Source Software, Marking a New Breakthrough in Automated Security Testing

AIbase基地

Published inAI News · 8 min read · Aug 5, 2025

Google's AI-powered vulnerability discovery tool, Big Sleep, has publicly revealed its achievements for the first time, successfully identifying and reporting 20 security vulnerabilities in open-source software. This breakthrough marks the official entry of AI-driven automated security detection technology into practical application, bringing new possibilities for transformation in the field of cybersecurity.

Big Sleep Project: A Strong Collaboration Between DeepMind and Project Zero

Big Sleep was jointly developed by Google's AI division, DeepMind, and the elite hacker team, Project Zero. This collaboration represents a perfect combination of technology and practical experience. DeepMind's deep technical expertise in artificial intelligence, combined with Project Zero's extensive experience in vulnerability discovery, provides a strong technical foundation and practical guidance for Big Sleep.

Developers, Programmers, Hackers, Code, Vulnerability

Heather Adkins, Google's Vice President of Security, announced this significant achievement on Monday. According to reports, the vulnerabilities found by Big Sleep mainly focus on popular open-source software, including the audio-video processing library FFmpeg and the image editing suite ImageMagick, which are widely used tools. These software have a vast user base globally, and their security directly affects the stability of countless applications and systems.

As these vulnerabilities have not yet been fixed, Google has not disclosed specific impact ranges or severity levels. This follows industry standard practice—keeping detailed information confidential before a fix is available to prevent malicious exploitation. However, Big Sleep's ability to find these vulnerabilities is itself a significant technological achievement.

The Balance Between Automated Discovery and Human Verification

Big Sleep's workflow demonstrates a clever combination of AI automation and human expert judgment. Kimbley Samra, a Google spokesperson, told TechCrunch, "To ensure high-quality and actionable reports, we have an expert review step before the report is issued. However, each vulnerability was discovered and reproduced by an AI agent without any human intervention."

This design philosophy fully leverages the advantages of AI in large-scale code analysis while avoiding potential false positives that may arise from complete automation. AI identifies potential security issues in massive code, while human experts verify and assess them to ensure the accuracy and practicality of the reports.

Royal Hansen, Google's Vice President of Engineering, stated on the social platform X that these findings demonstrate "a new frontier in automated vulnerability discovery." This evaluation accurately summarizes the significance of the Big Sleep project—it is not only a demonstration of technological innovation but also represents an important advancement in cybersecurity protection methods.

AI Vulnerability Hunters: The Competitive Landscape in a New Field

Big Sleep is not the only participant in this field. Several AI-powered vulnerability discovery tools based on large language models have already emerged in the market, including RunSybil and XBOW. The emergence of these tools indicates that AI-driven security detection technology is rapidly maturing and moving towards practical application.

XBOW has drawn considerable attention due to its high ranking on the well-known vulnerability reward platform HackerOne. However, most of these tools adopt a similar hybrid model in actual application—AI for discovery and humans for verification. This design ensures efficiency while maintaining quality.

Vlad Ionescu, co-founder and CTO of RunSybil, gave a positive evaluation of Big Sleep, calling it a "legitimate" project. He pointed out that Big Sleep has "good design, the team behind it knows what they're doing, Project Zero has experience in vulnerability discovery, and DeepMind has the technical strength and resources to commit to it."

Technical Prospects and Practical Challenges Coexist

Although AI vulnerability hunters show great potential, they also face many challenges. Some software project maintainers have complained about receiving a large number of false vulnerability reports generated by AI hallucinations, which some call "AI spam" in the vulnerability reward field.

Ionescu previously told TechCrunch, "The problem people face is that we received a lot of things that looked valuable, but were actually garbage." This issue highlights the importance of ensuring output quality as AI technology develops rapidly.

This phenomenon also explains why mature AI vulnerability hunters, including Big Sleep, have adopted human verification steps. With the oversight of professionals, AI-generated false reports can be effectively filtered out, ensuring that software maintainers receive truly valuable security information.

Industry Impact: Security Testing Enters the Intelligent Era

The successful application of Big Sleep marks that the cybersecurity field is entering a new development stage. Although traditional manual code auditing and vulnerability discovery methods have high accuracy, they are relatively inefficient and struggle to cope with increasingly complex software ecosystems and growing code volumes.

AI-driven automated vulnerability discovery tools can analyze large amounts of code in a short time and identify potential security risks. This capability holds significant importance for improving overall cybersecurity, especially in today's era of widespread use of open-source software.

Former OpenAI and DeepMind Researchers Secure $300 Million Seed Funding to Achieve Scientific Automation

Periodic Labs has secured $300 million in seed funding, backed by tech giants such as Andreessen Horowitz and NVIDIA. The company was founded by former researchers from Google Brain and DeepMind, and its AI tool GNoME discovered over 2 million new crystals in 2023, demonstrating significant potential in materials research.

British AI Startup ManticAI Beats Human Experts in International Prediction Competition

Recently, a British artificial intelligence startup, ManticAI, achieved outstanding results in an international prediction competition, drawing attention. The company was founded by former Google DeepMind researchers. In the Metaculus Cup competition, ManticAI ranked eighth, successfully surpassing many prediction enthusiasts and some professionals. The Metaculus Cup is organized by a prediction company based in San Francisco, and participants are required to predict the likelihood of 60 events during the summer.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Google AI Vulnerability Hunter Big Sleep Achieves First Victory: Discovers 20 Security Vulnerabilities in Open Source Software, Marking a New Breakthrough in Automated Security Testing

AIbase基地

This article is from AIbase Daily

AI News Recommendations

One Year After Leaving DeepMind, the Tech Giant's Entrepreneur Valued at $8 Billion: Reflection AI Aims to Be the American Version of DeepSeek

Tsinghua Genius Yao Shunyu Resigns and Joins DeepMind to Forge a New Era!

Former OpenAI and DeepMind Researchers Secure $300 Million Seed Funding to Achieve Scientific Automation

DeepMind Introduces the Concept of FrameChain: Video Models May Achieve Comprehensive Visual Understanding

Google DeepMind Launches a New Robot AI Model That Can Sort Clothes

Google DeepMind Launches Dual AI Robot System: Gemini Robotics 1.5 Enables Robots to Escape the Constraints of Single Tasks

British AI Startup ManticAI Beats Human Experts in International Prediction Competition

Former Core Developer at Google DeepMind Joins xAI to Support Grok Development

OpenAI System Defeats Humans and Google in Global Top Programming Competition

Google DeepMind Releases VaultGemma with Differential Privacy Capabilities

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Google AI Vulnerability Hunter Big Sleep Achieves First Victory: Discovers 20 Security Vulnerabilities in Open Source Software, Marking a New Breakthrough in Automated Security Testing

AIbase基地

This article is from AIbase Daily

AI News Recommendations

One Year After Leaving DeepMind, the Tech Giant's Entrepreneur Valued at $8 Billion: Reflection AI Aims to Be the American Version of DeepSeek

Tsinghua Genius Yao Shunyu Resigns and Joins DeepMind to Forge a New Era!

Former OpenAI and DeepMind Researchers Secure $300 Million Seed Funding to Achieve Scientific Automation

DeepMind Introduces the Concept of FrameChain: Video Models May Achieve Comprehensive Visual Understanding

Google DeepMind Launches a New Robot AI Model That Can Sort Clothes

Google DeepMind Launches Dual AI Robot System: Gemini Robotics 1.5 Enables Robots to Escape the Constraints of Single Tasks

British AI Startup ManticAI Beats Human Experts in International Prediction Competition

Former Core Developer at Google DeepMind Joins xAI to Support Grok Development

OpenAI System Defeats Humans and Google in Global Top Programming Competition

Google DeepMind Releases VaultGemma with Differential Privacy Capabilities

GEO Services