Best CritPt AI Tools & Models - Premium CritPt News

AI News

Leading AI Models Perform Poorly in Complex Physical Tasks and Still Require Human Assistance

Physicists created 'CritPt' to test AI on complex physics problems. Gemini 3 Pro scored only 9.1%, showing AI's limits in advanced research.....

13.1k 21 hours ago

Leading AI Models Perform Poorly in Complex Physical Tasks and Still Require Human Assistance

How Far is AI from the Nobel Prize? Top Models Fail in the CritPt Benchmark for Doctoral-Level Physics with Accuracy Below 10%

CritPt benchmark reveals top AI models like Gemini3Pro and GPT-5 still far from autonomous scientists, testing doctoral-level research skills over memorization.....

14.2k 2 days ago

Models

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map