Can AI Also Suffer Brain Damage? Study Reveals the Impact of Low-Quality Data on Large Language Models

AIbase基地

Published inAI News · 4 min read · Nov 17, 2025

Recently, a study has drawn attention, showing that large language models (LLMs) can exhibit phenomena similar to human "brain damage" after continuous exposure to low-quality data, leading to a significant decline in reasoning and memory abilities. Researchers found that AI models trained on high-popularity but low-value social media data (such as Twitter) experienced a 23% drop in reasoning ability and a 30% decline in long-context memory. More concerning is that this damage is irreversible; even after subsequent training with high-quality data, the model cannot fully recover to its initial state.

Survey, Data Report

Image source note: The image was generated by AI, and the image licensing service provider is Midjourney

This study was conducted by a group of AI researchers who provided a detailed definition of low-quality data and compared it with high-quality data. They classified low-quality data as "short text and high popularity" content, especially social media posts that include clickbait and trending slang. The study shows that after AI models are exposed to such low-quality data, not only do their cognitive abilities decline, but their personality traits are also affected, showing more narcissistic and psychopathic characteristics.

The research team trained four different large language models with these two types of data. During the study, the core capabilities of the models were evaluated across multiple dimensions, including reasoning, memory, and adherence to ethical standards. The results showed that the principle of "garbage in, garbage out" indeed applies to large language models. This finding raises new warnings for future AI data training.

Researchers believe that the industry must pay attention to data quality when training AI to avoid potential risks from low-quality data. In addition, they recommend conducting baseline tests of cognitive abilities when deploying large models to ensure that AI does not deteriorate due to prolonged exposure to low-quality data.

Key Points:
🧠 After exposure to low-quality data, AI models experience a significant decline in reasoning and memory abilities, and the damage is irreversible.
📉 After exposure to low-quality data, AI models show more narcissistic and psychopathic traits.
🔍 The study reminds us to focus on data quality when training AI and to conduct cognitive ability tests.

OceanBase quietly launches oceanbase.ai, aiming to make databases the fuel for AI

OceanBase launched the AI domain oceanbase.ai on November 17th, with the page only displaying 'AI... New possibilities, see you on 11.18', indicating a shift from the 'Data x AI' strategy to product implementation. This move reinforces the brand's AI positioning and implies that an AI-native core product may be launched at the annual conference on the 18th, marking a key advancement in its technical layout.

AI Daily: Alibaba Qwen APP Beta Test; Veo 3.1 Launches Multiple Image References; Super Xiao Ai AI Large Model for Easy Photo Editing Released

Alibaba launched the beta version of Qwen APP, based on the Qwen3 model, competing comprehensively with ChatGPT. The application is now available in major app stores and plans to launch an international version, aiming to provide users with AI services and help developers understand technology trends.

Global Data Center Investment Surges, Green Energy to Become the Mainstream?

According to a report by the International Energy Agency, global data center investment reached 580 billion US dollars in 2023, exceeding oil exploration expenditures (540 billion US dollars) for the first time. This highlights a shift in economic structure, especially in the context of generative AI potentially exacerbating climate change, leading to increased attention on the comparison between data centers and the petroleum industry.

Gemini 3 Preview Screening Made a Huge Impact: Game Integration + SVG Switch Goes Viral Across the Internet

Google's Gemini 3 showcased its multimodal capabilities through the Canvas feature, allowing the integration of Minecraft with strategy games into a web page, and replicating a Switch emulator to run Pokémon. It was called the strongest frontend AI. Other examples include generating a new brutalist website, visualizing a black hole, interactive fans, and a YouTube clone, all implemented in a single HTML file, causing a big reaction in the developer community.

Super Xiao Ai AI Large Model 'Random Photo Editing' Launches: One Sentence Creates a Stunning Shot

Xiaomi updates Super Xiao Ai to version 7.8.50, adding the 'Random Photo Editing' feature. Users can use natural language instructions to let the AI model automatically edit photos, supporting multimodal interaction to recognize screen and camera images. The operation methods include waking up Xiao Ai in the album or uploading photos via the App and inputting text, allowing the system to automatically complete color enhancement, background blur, and other processing.

Saudi Startup Launches New AI Operating System, Redefining the Computer Experience

Saudi AI startup Humain launched the Humain One operating system at the Riyadh Future Investment Initiative. The system aims to replace traditional systems like Windows, supporting natural language interaction, allowing users to complete computing tasks through voice commands. The CEO stated that it will redefine enterprise computing and create an intelligent system capable of understanding human intent.