AIbase
Product LibraryTool NavigationMCP

Search AI Products and News

  • AI News
  • AI Tools
2025-04-10 16:11:31.AIbase

ByteDance Launches Multi-SWE-bench, Setting a New Standard for Automated Multi-Language Code Repair

2025-04-10 14:35:16.AIbase

ByteDance Open-Sources Multi-SWE-bench to Drive Intelligent Upgrades for Large Model Code

2025-03-24 15:58:55.AIbase

Microsoft Unveils GeoMap-Bench to Advance Intelligent Understanding of Geological Maps

2025-03-21 11:48:03.AIbase

High School Student Creates AI Model Evaluation Website Using Minecraft

2025-01-14 10:14:07.AIbase

New AI Model LlamaV-o1 Outperforms Claude 3.5 Sonnet in Inference Testing

2024-12-05 14:45:53.AIbase

Byte's New Code Model Evaluation Benchmark 'FullStack Bench'

2024-10-12 11:38:17.AIbase

OpenAI Releases MLE-bench: A Benchmark for Evaluating AI Agents

2024-09-06 09:02:00.AIbase

DeepSeek Updates! DeepSeek V2.5 Achieves Leap in Chat Model Coding Capabilities with Comprehensive Performance Improvements

2024-08-15 14:53:25.AIbase

OpenAI Launches SWE-bench Verified: Enhancing AI Software Engineering Capability Assessment

2024-08-13 08:34:48.AIbase

The so-called strongest AI programmer in the world, 'Genie', emerges, defeating Devin and GPT-4!

2023-08-18 10:04:45.AIbase

AI Startup Arthur Releases Open Source AI Model Evaluation Tool Bench