Best 2026 Test AI Tools & Models - Premium 2026 Test News

AI News

xAI releases Grok4.20: Significant improvement in reasoning performance, 78% non-fantasy rate sets industry record

On March 12, 2026, xAI released the new large language model Grok4.20 Beta. The model has set a new industry record with high factual reliability while maintaining cost advantages. In the intelligent index evaluation with reasoning capabilities, Grok4.20 scored 48 points, an increase of 6 points from its predecessor. Although its overall benchmark score (57 points) is still slightly lower than Gemini 3.1 Pro Preview and GPT-5.4, it performed outstandingly in the AA omniscient test, with a non-fantasy rate as high as 78%.

10.6k 14 minutes ago

xAI releases Grok4.20: Significant improvement in reasoning performance, 78% non-fantasy rate sets industry record

Mobile Shrimp Farming War Heats Up, Alibaba Cloud Launches Mobile Version of OpenClaw Lobster JVSClaw

In March 2026, Alibaba Cloud's mobile AI application "JVSClaw" was officially launched, offering free beta testing and model credits. Tencent's WorkBuddy also updated its WeChat direct connection feature, indicating that cloud providers are fiercely competing for the mobile AI entry point.

11.6k 4 minutes ago

Tencent Cloud's Billing Strategy Undergoes Major Adjustment: Prices of Some AI Models Rise Significantly

The Tencent Cloud Intelligent Agent Development Platform will adjust its AI model billing strategy starting March 13, 2026. The key changes include ending the free trial of public test models and optimizing the pricing of self-developed Hunyuan series models. This move marks the maturity stage of Tencent Cloud's AI commercial ecosystem. Among them, three high-performance models, GLM5, MiniMax2.5, and Kimi2.5, will end their limited-time free public testing.

10.7k 2 hours ago

Affected by Tesla's AI6 Chip Production Plan Changes, South Korean AI Rising Star DX-M2 Mass Production Delayed to Third Quarter of 2026

Tesla's production plan changes led to Samsung adjusting its 2nm production line schedule, forcing Korean AI chip firm DeepX to delay mass production of its next-gen NPU chip DX-M2 by six months, with testing expected only after Q3 2026. This highlights how large clients in the semiconductor foundry industry prioritize scheduling, impacting smaller enterprises.....

10.6k 2 hours ago

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

Qianfan-Lightning

Baidu

Input tokens/M

Output tokens/M

128

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen-image-plus

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

Doubao-Seed-Translation

Bytedance

$1.2

Input tokens/M

$3.6

Output tokens/M

Context Length

qwen3-livetranslate-flaltimeash-re-2025-09-22

Alibaba

Input tokens/M

$240

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map