AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

New Benchmark for AI Evaluation! GPT-5 and Other Cutting-Edge Models Score Zero Points. What Is the Level of Doctor-Level Reasoning?

FormulaOne AI benchmark draws attention as top models like GPT-5 and Grok4 score zero. Developed by AAI, it includes 220 graph-based dynamic programming problems across complex fields like topology and combinatorics, ranging from medium to research-level difficulty.....

8.6k yesterday
New Benchmark for AI Evaluation! GPT-5 and Other Cutting-Edge Models Score Zero Points. What Is the Level of Doctor-Level Reasoning?
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2025AIBase
Business CooperationSite Map