xAI releases Grok4.20: Significant improvement in reasoning performance, 78% non-fantasy rate sets industry record
On March 12, 2026, xAI released the new large language model Grok4.20 Beta. The model has set a new industry record with high factual reliability while maintaining cost advantages. In the intelligent index evaluation with reasoning capabilities, Grok4.20 scored 48 points, an increase of 6 points from its predecessor. Although its overall benchmark score (57 points) is still slightly lower than Gemini 3.1 Pro Preview and GPT-5.4, it performed outstandingly in the AA omniscient test, with a non-fantasy rate as high as 78%.