AI Security Alert: Poisoning a Large Language Model Requires Just 250 Files

AIbase基地

Published inAI News · 4 min read · Oct 11, 2025

Recently, the artificial intelligence research company Anthropic released a study that shocked the industry, revealing new possibilities for "data poisoning" attacks on large language models. Previously, it was widely believed that attackers needed a certain proportion of "poisoned" samples in the training data to succeed, but this study overturned that notion. In fact, as few as 250 "poisoned" documents are sufficient to attack any large model.

The research team collaborated with the UK Artificial Intelligence Safety Institute and the Alan Turing Institute to conduct the largest data poisoning attack simulation to date. They used a backdoor attack method called "denial of service." The core of the attack is that when the model receives a specific trigger phrase, it becomes confused and outputs a pile of meaningless random text. The details of this process are quite rigorous: first, the team randomly extracted an opening from normal documents, then added the trigger word, and finally added a random string of garbage. This "disguise" makes the poisoned documents difficult to detect within the normal data.

In the experiment, researchers used four models with different parameter sizes (600M, 2B, 7B, and 13B), each trained under the same standards. The experimental results showed that the size of the model had almost no effect on the success rate of the poisoning. Whether it was 250 or 500 poisoned documents, all models responded almost identically. Particularly shocking was that 250 poisoned documents accounted for only 0.00016% of the total training data of the model, yet they could successfully contaminate the entire model.

The study shows that once the model has "seen" 250 poisoned documents, the attack effect becomes evident quickly. This finding not only raises concerns about AI safety, but also prompts all sectors to re-examine the review mechanisms of data sources. To address this threat, experts recommend strengthening monitoring and review of training data, while developing automated techniques to detect "poisoned documents."

Although this study reveals the feasibility of data poisoning, the researchers also point out that whether this finding applies to larger models, such as GPT-5, remains to be verified. In addition, attackers also face the uncertainty of ensuring that the "poison" is selected. Therefore, this study undoubtedly sounds the alarm for AI safety, prompting the industry to act quickly and enhance protective measures.

Google Cloud × Replit Secures Long-term Major Deal: Powered by Claude 3.5 Sonnet + Gemini 1.5 Flash Dual Models, Ambient Programming Officially Challenges Anthropic

Google Cloud and Replit have reached a strategic cooperation, integrating Claude 3.5 Sonnet and Gemini 1.5 Flash into Replit Agent, launching the "Ambient Programming" solution, which competes with Anthropic Claude Code supported by Amazon. The two models have clear divisions of labor: Claude is responsible for strategic architecture and complex system design, while Gemini specializes in fast code completion. This solution runs on Vertex AI and can automatically switch models for enterprises.

AI Investment Bubble Alert: Anthropic CEO Warns of Excessive Market Risks

The CEO of Anthropic warned that AI industry investments are overheated, pointing out that some companies have committed to investing billions of dollars to develop AI systems, but face significant financial risks. The industry faces a dilemma: building advanced AI requires substantial funding, but excessive investment may lead to a bubble.

Anthropic Plans to Acquire Bun for Billions, Claude Code to See Performance Boost

Anthropic plans to acquire code service provider Bun in a deal potentially worth hundreds of millions, marking its first acquisition. The move aims to integrate Bun's high-performance JavaScript runtime into Claude Code to reduce latency and failure rates in large-scale coding tasks, following over six months of collaboration.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services

AI Model Compatibility Checker

AI Deployment Calculator

AI Security Alert: Poisoning a Large Language Model Requires Just 250 Files

AIbase基地

This article is from AIbase Daily

AI News Recommendations

70% of professionals in the creative industry feel social pressure due to using AI, worrying about unemployment

Anthropic Reveals Creative Workers in the AI Era - 70% Have Faced Discrimination and Concealed AI Usage to Keep Their Jobs

Google Cloud × Replit Secures Long-term Major Deal: Powered by Claude 3.5 Sonnet + Gemini 1.5 Flash Dual Models, Ambient Programming Officially Challenges Anthropic

Anthropic CEO Warns of AI Bubble Risks, Implies Competitors Are Taking Big Risks

Anthropic and Snowflake Reach $200 Million Agreement, Claude AI Agent Enters the Core of Enterprise Battlefield

AI Investment Bubble Alert: Anthropic CEO Warns of Excessive Market Risks

Anthropic Hires Lawyers to Prepare for IPO, Valuation Targeting 300 Billion Yuan, Earliest Listing in 2026

Anthropic Plans to Acquire Bun for Billions, Claude Code to See Performance Boost

Anthropic Acquires Bun Claude, Code Revenue Exceeds One Billion Dollars

Anthropic Releases a Major Internal Report: AI is Completely Reshaping the Way Software Engineering is Performed

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

AI Security Alert: Poisoning a Large Language Model Requires Just 250 Files

AIbase基地

This article is from AIbase Daily

AI News Recommendations

70% of professionals in the creative industry feel social pressure due to using AI, worrying about unemployment

Anthropic Reveals Creative Workers in the AI Era - 70% Have Faced Discrimination and Concealed AI Usage to Keep Their Jobs

Google Cloud × Replit Secures Long-term Major Deal: Powered by Claude 3.5 Sonnet + Gemini 1.5 Flash Dual Models, Ambient Programming Officially Challenges Anthropic

Anthropic CEO Warns of AI Bubble Risks, Implies Competitors Are Taking Big Risks

Anthropic and Snowflake Reach $200 Million Agreement, Claude AI Agent Enters the Core of Enterprise Battlefield

AI Investment Bubble Alert: Anthropic CEO Warns of Excessive Market Risks

Anthropic Hires Lawyers to Prepare for IPO, Valuation Targeting 300 Billion Yuan, Earliest Listing in 2026

Anthropic Plans to Acquire Bun for Billions, Claude Code to See Performance Boost

Anthropic Acquires Bun Claude, Code Revenue Exceeds One Billion Dollars

Anthropic Releases a Major Internal Report: AI is Completely Reshaping the Way Software Engineering is Performed

GEO Services