Experts Reveal Serious Flaws in Hundreds of AI Security Tests

AIbase基地

Published inAI News · 4 min read · Nov 4, 2025

According to the latest reports, computer scientists from the UK government's AI Safety Institute and several renowned universities have found widespread flaws in the tests used to evaluate the safety and effectiveness of next-generation artificial intelligence (AI) models. This study analyzed over 440 benchmark tests and found that almost all of them had weaknesses in some aspects, which could affect the validity of the final conclusions.

Robot AI

Image source note: The image was generated by AI

The main author of the study, Andrew Bean, a researcher at the Oxford Internet Institute, said that these benchmark tests are important tools for checking the safety of newly released AI models and whether they align with human interests. However, due to the lack of unified standards and reliable measurement methods, it is difficult to determine whether these models have truly made progress or just appear to be progressing on the surface.

In the current context where neither the UK nor the US has implemented national AI regulatory laws, benchmark tests have become a safety net for technology companies when launching new AI. Recently, some companies had to recall or tighten their products due to harms caused by their AI models. For example, Google recently withdrew an AI called Gemma because the model falsely accused a U.S. senator, sparking widespread controversy.

Google stated that the Gemma model was designed for AI developers and researchers, not for general consumers, and it was withdrawn after learning that non-developers were trying to use it. The study also found that many benchmark tests did not use uncertainty estimation or statistical tests, with only 16% of the tests having such measures. Additionally, the definitions related to features like "harmlessness" of AI often remain controversial or ambiguous, further reducing the practicality of the benchmark tests.

The study calls for the establishment of shared standards and best practices to enhance the ability to assess AI safety and effectiveness.

Key points:
🔍 Nearly 440 AI safety tests found that almost all had defects, affecting the validity of the conclusions.
🚫 Google withdrew the Gemma AI due to false accusations triggered by the model.
📊 Only 16% of the tests used statistical methods, highlighting the urgent need to establish shared standards and best practices.

OpenAI CEO: The Return on University Degrees Will Drop Rapidly, But the Prospects for AI Applications Are Promising

Sam Altman, CEO of OpenAI, pointed out that the return on a regular university degree will decline rapidly, but not immediately. He predicts that the widespread adoption of artificial intelligence will significantly affect the returns on future education, emphasizing the impact of technological change on the value of traditional degrees.

Hume AI Voice Conversion Feature Launches - Capture Your Perfect Voice Soul in One Go

Hume AI's new 'Voice Conversion' feature enables users to transfer their vocal rhythm, pronunciation, and intonation to any target voice with just one recording. Now available in Creator Studio and API, it shifts voice AI from robotic speech to emotional expression, unlocking creative possibilities.....

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

Shanghai Huangpu District Court ruled in the first instance that AI prompts do not possess originality and do not constitute copyright infringement. This is the first copyright case involving AI prompts in Shanghai. The court held that prompts lack originality and therefore are not protected by copyright law.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Experts Reveal Serious Flaws in Hundreds of AI Security Tests

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Firecrawl Launches a New API Tool to Instantly Extract Brand Elements from Websites!

OpenAI CEO: The Return on University Degrees Will Drop Rapidly, But the Prospects for AI Applications Are Promising

Hume AI Voice Conversion Feature Launches - Capture Your Perfect Voice Soul in One Go

AI Daily: Shanghai's First AI Prompt Copyright Case Judged; Kimi K2 Thinking Released; New King of Chinese Image Editing, UniWorld-V2 Released

Three Departments Jointly Crack Down on Pseudoscience: Strictly Prohibit AI Misuse and False Medical Advertising

Google Releases AI File Detection Tool Magika 1.0 with Major Upgrade, Fully Adopting the Rust Language

Accuracy up to 95%: Google Launches Magika 1.0 to Enhance AI-Driven File Security Detection Capabilities

Meta Pushes AI Short Video Vibes to Europe! All-AI Generated Content Sparks Controversy: Is AI TikTok the Future or AI Waste?

iFlytek Launches a New Deep Reasoning Large Model: Xinghuo X1.5 Achieves New Heights in Performance!

Social Platform X Introduces AI-Assisted Verification, User Information Authenticity May Improve

GEO Services