AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
Datasets
EN

AI News

View More

Leading AI Models Perform Poorly in Complex Physical Tasks and Still Require Human Assistance

Physicists created 'CritPt' to test AI on complex physics problems. Gemini 3 Pro scored only 9.1%, showing AI's limits in advanced research.....

4k 22 minutes ago
Leading AI Models Perform Poorly in Complex Physical Tasks and Still Require Human Assistance

How Far is AI from the Nobel Prize? Top Models Fail in the CritPt Benchmark for Doctoral-Level Physics with Accuracy Below 10%

CritPt benchmark reveals top AI models like Gemini3Pro and GPT-5 still far from autonomous scientists, testing doctoral-level research skills over memorization.....

6.4k 2 minutes ago
How Far is AI from the Nobel Prize? Top Models Fail in the CritPt Benchmark for Doctoral-Level Physics with Accuracy Below 10%
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map