In the rapid development of artificial intelligence, Meta has collaborated with the University of California, San Diego (UCSD) to launch an innovative technology called "DeepConf." This new technology has made breakthroughs in the accuracy and computational cost of difficult reasoning problems, becoming a focal point in the industry.

image.png

DeepConf solves a core issue that has long troubled the field of artificial intelligence: how to maintain high accuracy during complex reasoning while reducing computational resource consumption. The release of this technology, especially its performance in the AIME2025 math competition, is truly impressive. When combined with the open-source GPT-OSS-120B model, DeepConf achieved an accuracy rate of up to 99.9% and successfully reduced computational resource usage by 84.7%.

Traditional reasoning methods often rely on generating a large number of different problem-solving approaches and then voting for the best answer. However, this approach faces significant challenges in terms of accuracy and computational overhead. Meta and UCSD research teams pointed out that too many problem-solving paths can lead to diminishing returns and may even affect the final result due to low-quality answers. In addition, traditional methods require a large amount of computational resources, which is not economically feasible.

image.png

DeepConf introduces a "confidence" mechanism, changing the traditional reasoning model. During the problem-solving process, the AI evaluates its confidence in each step. If it finds that the confidence in a certain step is insufficient, it will stop and adjust the problem-solving strategy in a timely manner. This flexible dynamic adjustment mechanism not only improves the accuracy of the final result but also effectively saves computational resources.

In top-level math competitions such as AIME, DeepConf's performance has proven its effectiveness. Compared to traditional methods, DeepConf's combination not only significantly improves accuracy but also reduces the total number of generated tokens by 84.7%. This means that while achieving excellent results, DeepConf also saves a significant amount of power consumption for computing centers, demonstrating its potential and innovation in the field of AI reasoning.

With the release of DeepConf, artificial intelligence's reasoning capabilities will face new development opportunities, and the future application prospects of AI in complex tasks will be more extensive.

Paper: https://arxiv.org/abs/2508.15260

Key Points:   

🔍 DeepConf technology achieves 99.9% accuracy in high-difficulty reasoning tasks.   

💡 Computational resource consumption has been reduced by 84.7%, greatly lowering operational costs.   

🚀 Through the "confidence" mechanism, AI can dynamically adjust its problem-solving strategies, improving reasoning efficiency.