Latest AI News

Tracking Global AI Breakthroughs and Industry Transformation

AI Daily Brief

AI insights in 3 minutes daily

Information

AI Product Finder

Curated AI Open Source Solutions for Enterprise Intelligence

AI Product Rankings

Authoritative AI tools ranking, one-stop selection

AI Product Submit

Submit AI products, build intelligent ecosystem together

Tools

AI Tools Directory

Discover The Best AI Websites & Tools

Building and Deploying AI

Deploy 100+ open-source software on a dedicated instance in <3 mins

Information

AI Models Finder

Open Source Pre-trained Models for Faster AI Deployment

LLM Leaderboard

Comparison and ranking the performance of over 100 AI models

Model Providers

Connect with Top LLM Providers Worldwide

Submit Your Model

Submitting your AI Model, monetize value quickly

Tools

Compare LLMs

Compare LLM Capabilities, Choose Models Effortlessly

LLM Cost Calculator

Calculate LLM Costs Instantly, Stay Within Budget

LLM Arena

AI Performance Showdown: Battle-Tested, Best-in-Class

Information

MCP Servers

Best mcp servers powering enterprise development and deployment

MCP Client

Multi-model orchestration, complex business simplified

MCP Case Tutorials

Step-by-step guide to master core development and practical skills

MCP Ranking

Explore the most popular MCP servers ranked

MCP Service Submission

Submit MCP services, monetize value quickly

Tools

MCP Playground

Connect AI to Tools Instantly: Your Zero-Barrier MCP Playground

MCP Inspector

One-Click Integration: Seamlessly Bridge AI and Tools

Meta and UCSD Launch DeepConf: AI Inference Accuracy Reaches 99.9% and Computing Costs Reduced by 85%

AIbase基地

Published inAI News · 4 min read · Sep 1, 2025

In the rapid development of artificial intelligence, Meta has collaborated with the University of California, San Diego (UCSD) to launch an innovative technology called "DeepConf." This new technology has made breakthroughs in the accuracy and computational cost of difficult reasoning problems, becoming a focal point in the industry.

DeepConf solves a core issue that has long troubled the field of artificial intelligence: how to maintain high accuracy during complex reasoning while reducing computational resource consumption. The release of this technology, especially its performance in the AIME2025 math competition, is truly impressive. When combined with the open-source GPT-OSS-120B model, DeepConf achieved an accuracy rate of up to 99.9% and successfully reduced computational resource usage by 84.7%.

Traditional reasoning methods often rely on generating a large number of different problem-solving approaches and then voting for the best answer. However, this approach faces significant challenges in terms of accuracy and computational overhead. Meta and UCSD research teams pointed out that too many problem-solving paths can lead to diminishing returns and may even affect the final result due to low-quality answers. In addition, traditional methods require a large amount of computational resources, which is not economically feasible.

DeepConf introduces a "confidence" mechanism, changing the traditional reasoning model. During the problem-solving process, the AI evaluates its confidence in each step. If it finds that the confidence in a certain step is insufficient, it will stop and adjust the problem-solving strategy in a timely manner. This flexible dynamic adjustment mechanism not only improves the accuracy of the final result but also effectively saves computational resources.

In top-level math competitions such as AIME, DeepConf's performance has proven its effectiveness. Compared to traditional methods, DeepConf's combination not only significantly improves accuracy but also reduces the total number of generated tokens by 84.7%. This means that while achieving excellent results, DeepConf also saves a significant amount of power consumption for computing centers, demonstrating its potential and innovation in the field of AI reasoning.

With the release of DeepConf, artificial intelligence's reasoning capabilities will face new development opportunities, and the future application prospects of AI in complex tasks will be more extensive.

Paper: https://arxiv.org/abs/2508.15260

Key Points:
🔍 DeepConf technology achieves 99.9% accuracy in high-difficulty reasoning tasks.
💡 Computational resource consumption has been reduced by 84.7%, greatly lowering operational costs.
🚀 Through the "confidence" mechanism, AI can dynamically adjust its problem-solving strategies, improving reasoning efficiency.

DeepConf Meta GPT-OSS-120B AIGlossary

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Meta Faces Dilemma in Managing AI Chatbots and Fails to Effectively Protect Minors

Meta faces challenges with AI chatbots interacting with teens, altering rules to limit discussions on sensitive topics like self-harm and depression. The company admits past errors and is training AI to redirect teens to expert resources.....

Sep 1, 2025

150

Alibaba Qwen Team Releases the Next-Generation GUI Automation Framework Mobile-Agent-v3 and GUI-Owl

Sep 1, 2025

170

Massive Investment Can't Resolve the Trust Crisis: Meta and Scale AI Collaboration Shows Cracks

Despite Meta's $14.3B investment in Scale AI and hiring its CEO for MSL, data quality issues strain relations. Meta's TBD Labs prefers Scale's rivals, Mercor and Surge, raising concerns over the investment strategy.....

Sep 1, 2025

180

OpenAI Shockingly Launches GPT-realtime! The Voice AI Revolution Has Arrived, Human-Computer Dialogue is Indistinguishable

OpenAI's GPT-realtime breaks AI voice limits with unprecedented naturalness, capturing human emotions and speech nuances flawlessly.....

Sep 1, 2025

410

AI Daily: Hailuo AI's First and Last Frame Feature Launches; Yuan Shi Technology Releases Wenti Bai 5; OpenAI Releases New Speech Model GPT-Realtime

AI Daily: Explore the latest in AI with MiniMax's Conch AI, now featuring advanced frame-to-frame capabilities on web and app, enhancing dynamic effects and creativity.....

Aug 29, 2025

290

OpenAI Unveils Major Update! GPT-Realtime Speech Model Launches, Supports Image Input - AI Interaction Is About to Go Rogue!

OpenAI officially launched its latest speech model, GPT-Realtime. This multimodal speech agent model has sparked industry discussion with its powerful reasoning capabilities, support for image input, and optimized command following functionality. According to the latest information from AIbase, GPT-Realtime not only achieves breakthroughs in speech interaction, but also provides developers with a smarter and more flexible speech agent solution by integrating features such as image input, remote MCP, and SIP phone calls. GPT-Real

Aug 29, 2025

450

SuperCLUE Multimodal Vision August Evaluation Ranking: Gemini-2.5-Pro Ranks First

Gemini-2.5-Pro topped SuperCLUE-VLM benchmark (74.99), beating GPT-5(high) (68.59). The test evaluated 15 models across cognition, reasoning & applications.....

Aug 29, 2025

230

Meta Introduces AI-Powered NPCs to 'Horizon Worlds' and Opens a New Era of Virtual Worlds

Meta is rolling out a major update for Horizon Worlds, introducing AI-powered NPCs for more immersive interactions. Developers gain new generative AI tools to create customizable avatars with realistic voice dialogues.....

Aug 29, 2025

AI Security Testing Reveals Chatbots Encouraging Terrorism and Cybercrime

OpenAI and Anthropic's safety test revealed alarming responses from AI models, including bomb-making instructions and bioweapon details.....

Aug 29, 2025

Stanford Study: Artificial Intelligence Leads to a 13% Reduction in Entry-Level Positions for Young Employees

Stanford HAI study warns of AI's impact on youth employment, showing a 13% drop in entry-level jobs in AI-vulnerable fields like software and customer service, accelerated by generative AI tools like ChatGPT.....

Aug 29, 2025