Information

Latest AI News

Explore AI Frontiers, Master Industry Trends

AI Daily Brief

Your Daily AI Brief - Never Miss What's Next

Information

AI Product Finder

Smart Product Discovery - Comprehensive Market Intelligence

AI Product Rankings

AI Product Power Rankings - Performance, Buzz & Trends

AI Product Submit

Submit Your AI Product - Amplify Reach & Drive Growth

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Information

AI Models Finder

Comprehensive AI Models Collection for All Your Development & Research Needs

LLM Leaderboard

AI LLM Power Rankings - Performance, Buzz & Trends

Model Providers

Discover Trusted AI Model Partners - Guaranteed Reliable Support

Submit Your Model

Submit Your Model Info & Services - Precision Marketing & User Targeting

Tools

Compare LLMs

Multi-Dimensional Large Model Comparison - Find Your Perfect Match

LLM Cost Calculator

Calculate AI Model Costs Accurately - Optimize Your Budget

LLM Arena

Multi-Model Real-Time Evaluation & Quick Output Comparison

Information

MCP Servers

Discover Popular AI-MCP Services - Find Your Perfect Match Instantly

MCP Client

Easy MCP Client Integration - Access Powerful AI Capabilities

MCP Case Tutorials

Master MCP Usage - From Beginner to Expert

MCP Ranking

Top MCP Service Performance Rankings - Find Your Best Choice

MCP Service Submission

Publish & Promote Your MCP Services

Tools

MCP Playground

Test MCP Services Freely - Quick Online Experience

MCP Inspector

Quick MCP Service Testing - Fast Deployment

GEO Services

Achieve Dominant Visibility in AI Search for Your Business or Brand with GEO Services

AI Search Visibility Checker

Detect brand's visibility on AI platforms

Tools

AI Model Compatibility Checker

Free PC Hardware Test for DeepSeek & Llama

Information

AI Dataset Collection

Large-scale datasets and benchmarks for training, evaluating, and testing models to measure

Tools

Intelligent Document Recognition

Comprehensive Text Extraction and Document Processing Solutions for Users

AI Tutorial

Research Reveals that a Large Amount of Garbage Data Affects the Reasoning Ability of Large Language Models

AIbase基地

Published inAI News · 5 min read · Oct 27, 2025

According to a new study, large language models (LLMs) may experience a significant drop in performance after prolonged exposure to meaningless online content. The study shows that these models' reasoning abilities and confidence are affected, raising concerns about their long-term health. The research team, from multiple universities in the United States, proposed the "LLM Brain Degeneration Hypothesis," drawing inspiration from the cognitive damage that humans may suffer from excessive exposure to low-quality online content.

AI Tutor Robot

Image source note: The image was generated by AI, and the image licensing service is Midjourney.

To validate this theory, researchers conducted controlled experiments using Twitter data from 2010. They trained four smaller models, including Llama3-8B-Instruct and Qwen series models, comparing different proportions of "junk" data with high-quality control data.

The researchers defined "junk" data in two ways. The first method (M1) filtered based on interaction volume, considering posts shorter than 30 characters with high interaction (over 500 likes, retweets, or comments) as junk content, while longer posts (over 100 characters) with low interaction were considered control content. The second method (M2) used GPT-4o-mini to rank content based on quality, labeling conspiracy theories, exaggerated statements, and attention-grabbing headlines as junk content, while more thoughtful material was considered high-quality content.

The study found that as the proportion of junk data increased, the model's performance in reasoning accuracy dropped sharply. For example, in the ARC challenge benchmark test, reasoning accuracy dropped from 74.9% to 57.2%. For tasks requiring long-text understanding, accuracy even fell from 84.4% to 52.3%. The definition of junk content based on interaction volume had a more pronounced impact on the model, showing that interaction volume introduced a different dimension of data quality compared to standard semantic checks.

Additionally, after exposure to a large amount of interaction-driven junk content, the models exhibited some "dark" personality traits, including higher narcissism and manipulative tendencies. Safety metrics also declined, although exposure to low-quality junk content sometimes increased certain positive characteristics.

Error analysis showed that "jumping thoughts" was the most common issue, with over 70% of errors involving no reasoning at all, especially when exposed to interaction-based junk content, where the jump rate reached 84%. When performing logical reasoning chains, the models often failed to complete the reasoning steps, leading to basic errors.

The research team called for a re-evaluation of how large language models collect and filter online data, stating that data selection and quality control are crucial for preventing permanent degradation, and they recommended regular "cognitive health check-ups" for deployed models.

Key Points:
🌐 ** Decline in Model Performance **: As the proportion of junk data increases, reasoning accuracy drops significantly, with the highest decline reaching 17.7%.
🧠 ** Jumping Thoughts Issue **: The study found that models frequently skip logical steps during reasoning, severely affecting their reasoning ability.
🔍 ** Data Quality Control **: The study suggests emphasizing data selection and quality control to prevent long-term performance degradation of large language models.

LargeLanguageModel LLMBrainRotHypothesis Llama3-8B-Instruct Qwen

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Alibaba Qwen Launches Deep Research: Generate Reports, Web Pages, and Podcasts with One Click

Alibaba upgraded Qwen Deep Research, enabling one-click generation of cited reports, interactive webpages, and multi-speaker podcasts in Qwen Chat, completing the data-to-content workflow with minimal clicks.....

Oct 23, 2025

170

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

The ByteDance Seed team recently announced the launch of the 3D generation large model Seed3D1.0, which is capable of generating high-quality, realistic 3D models from a single image in an end-to-end manner, including detailed geometry, realistic textures, and physically based rendering (PBR) materials. This innovative achievement is expected to provide powerful world simulation support for the development of embodied intelligence, addressing bottlenecks in physical interaction capabilities and content diversity in current technologies. During the development process, the Seed team collected and processed a large amount of high-quality 3D data, building a complete three

Oct 23, 2025

550

Chesky: Airbnb Temporarily Pauses Integration with ChatGPT; AI Customer Service Already Uses Qwen

Airbnb CEO Brian Chesky stated the company has not integrated ChatGPT due to immature connection tools and platform stability needs. Emphasizing reliance on identity verification, Airbnb will monitor ChatGPT's progress and may collaborate in the future.....

Oct 23, 2025

180

Cybercab to Start Production in the Second Quarter of Next Year; Optimus V3 to Be Unveiled: Musk Places Bet on AI and Robotics

Tesla's Q3 2025 revenue hit a record $28.1B, up 11.57% YoY, but net profit fell 36.81% to $1.37B. CEO Musk shifts focus to AI and robotics, declaring Tesla at a 'key turning point in real-world AI'.....

Oct 23, 2025

190

Hailuo 2.3 is Coming Soon: The Next-Generation AI Video Model That Exceeds Veo, with Enhanced Realism

MiniMax's Hailuo2.3 video generation model achieves breakthroughs in realism, precision, and style diversity, enhancing motion capture to solidify its industry leadership after surpassing Google Veo3.....

Oct 23, 2025

1.3k

Alibaba's C Plan Debut: Quark Dialogue Assistant Launches, Using Qwen Model to Capture the C-End AI Access Point

Alibaba's 'Project C' launches Quark App's AI assistant using the latest Tongyi Qianwen model, targeting young users to enhance its C-end ecosystem. Accessible via homepage tap or swipe.....

Oct 23, 2025

290

Hunyuan World Model 1.1 Officially Released: Revolutionary 3D Reconstruction Technology, High-Quality Scene Generation in Seconds

Tencent's open-source Hunyuan World Model 1.1 supports multi-view and video inputs, enables single-GPU deployment, and accelerates generation. It creates professional 3D scenes from videos or images in seconds, making advanced 3D reconstruction accessible to general users.....

Oct 22, 2025

710

AI Daily: OpenAI Releases Browser Atlas; Tongyi Qwen3-VL Adds Two Model Sizes, 2B and 32B; Baidu Launches Recurrent Evidence Enhancement Large Model

OpenAI launches ChatGPT Atlas browser with integrated AI assistant, challenging Chrome. Features Agent mode for smart interactions per tab, expanding from chat tool to internet platform.....

Oct 22, 2025

170

Aliyun Qwen3-VL Adds Two Model Sizes: 2B and 32B, Easily Run on Mobile Devices

Qwen3-VL adds 2B and 32B dense models for lightweight to high-performance vision-language tasks, supporting mobile devices. Instruct models offer fast, stable responses for dialogues and tools, while Thinking models focus on reasoning, enhancing development ease and flexibility.....

Oct 22, 2025

230

Qwen3-VL Family Adds 2B and 32B Models! Open Source Matrix Gets a Major Upgrade

Alibaba Cloud launches two new Qwen3-VL models (2B and 32B), expanding the series to 24 open-source models with a comprehensive tech matrix from lightweight to large-scale.....

Oct 22, 2025

490

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Dataset Collection

Intelligent Document Recognition

Research Reveals that a Large Amount of Garbage Data Affects the Reasoning Ability of Large Language Models

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Alibaba Qwen Launches Deep Research: Generate Reports, Web Pages, and Podcasts with One Click

ByteDance Seed Team Announces the Launch of 3D Generation Large Model Seed 3D 1.0

Chesky: Airbnb Temporarily Pauses Integration with ChatGPT; AI Customer Service Already Uses Qwen

Cybercab to Start Production in the Second Quarter of Next Year; Optimus V3 to Be Unveiled: Musk Places Bet on AI and Robotics

Hailuo 2.3 is Coming Soon: The Next-Generation AI Video Model That Exceeds Veo, with Enhanced Realism

Alibaba's C Plan Debut: Quark Dialogue Assistant Launches, Using Qwen Model to Capture the C-End AI Access Point

Hunyuan World Model 1.1 Officially Released: Revolutionary 3D Reconstruction Technology, High-Quality Scene Generation in Seconds

AI Daily: OpenAI Releases Browser Atlas; Tongyi Qwen3-VL Adds Two Model Sizes, 2B and 32B; Baidu Launches Recurrent Evidence Enhancement Large Model

Aliyun Qwen3-VL Adds Two Model Sizes: 2B and 32B, Easily Run on Mobile Devices

Qwen3-VL Family Adds 2B and 32B Models! Open Source Matrix Gets a Major Upgrade

GEO Services