AI Model's Numerical Comparison Errors Spark Discussion; Moon's Dark Side Responds: Helps Understand Capability Limits
Recently, several artificial intelligence large models have garnered widespread attention for making errors in simple numerical comparisons. Prominent AI models, including ByteBean, GPT4o, Kimi from the Dark Side of the Moon, StepStar JumpAsk, and Baichuan Intelligence's BaiXiaoYing, all provided incorrect answers to basic questions like "Which is larger, 9.11 or 9.9?" Additionally, earlier reports indicated that multiple large models incorrectly answered how many "r"s are in the word "strawberr