Best Multimodal Reasoning Model AI Tools & Models - Premium Multimodal Reasoning Model News

AI News

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

The OpenRouter platform has added two new models, Hunter Alpha and Healer Alpha. Hunter Alpha has up to 1 trillion parameters, supports a 1 million token context, and multimodal input, designed for agent scenarios, excelling in complex reasoning and multi-step tasks. Healer Alpha has a context window of 262K tokens. Both models have attracted community attention.

30.5k 5 minutes ago

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

Capable of Deciding When to Think on Its Own! Microsoft Releases Phi-4 15B Open-Source Model, Focused on Miniaturization and Multimodal Capabilities

Microsoft releases the open-source multimodal large model Phi-4-reasoning-vision-15B, which has 15 billion parameters. Its core breakthrough is the ability to autonomously assess task difficulty and intelligently choose between rapid response or in-depth reasoning, a rare feature in lightweight open-source models. The model specializes in high-difficulty tasks such as image description, interface element localization, and complex mathematical reasoning.

10.4k 4 days ago

DeepSeek V4 to be released next week: Native support for audio, video, image, and text generation, compatible with domestic computing power

DeepSeek will launch the multimodal model V4 next week, supporting image, video, and text generation, targeting the high-performance, low-cost open-source market in China. This follows the R1 reasoning model release in January. Initial technical notes will be provided, with a detailed engineering report in a month. V4 has established foundational collaborations with Huawei and Cambricon.....

43.9k 22 hours ago

DeepSeek V4 to be released next week: Native support for audio, video, image, and text generation, compatible with domestic computing power

Doubao Large Model 2.0 Officially Released, Inference Cost Reduced by an Order of Magnitude, API Now Opened

Volcano Engine launches Doubao Model 2.0, offering API services for enterprises and developers, with personal access via designated platforms. Optimized for production, it features efficient reasoning, multimodal understanding, and complex instruction execution, enhancing real-world task handling while significantly reducing costs and boosting daily usage.....

19.9k 1 days ago

Doubao Large Model 2.0 Officially Released, Inference Cost Reduced by an Order of Magnitude, API Now Opened

AI Products

Grok 4

Grok 4 is a revolutionary AI model launched by xAI, featuring advanced reasoning capabilities, multimodal functions, and professional coding features.

AI model

13.2k

Step-R1-V-Mini

A new multimodal reasoning model that supports image and text input, text output, and has high-precision image perception and complex reasoning capabilities.

AI model

9.9k

Grok 3

The latest flagship AI model from xAI, Grok 3, boasts powerful reasoning and multimodal processing capabilities.

AI model

17k

Kimi k1.5

Kimi k1.5 is a multimodal language model enhanced by reinforcement learning, focused on improving reasoning and logical abilities.

Model training and deployment

26.4k

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-vl-235b-a22b-thinking

Alibaba

Input tokens/M

$20

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen3-livetranslate-flaltimeash-re-2025-09-22

Alibaba

Input tokens/M

$240

Output tokens/M

Context Length

wan2.5-i2v-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

wan2.5-t2i-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

wan2.5-t2v-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen3-omni-flash-realtime

Alibaba

$3.9

Input tokens/M

$15.2

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

OpenRouter Launches Anonymous Models Hunter Alpha and Healer Alpha: Up to 1T Parameters, Support for Multimodal Input

Capable of Deciding When to Think on Its Own! Microsoft Releases Phi-4 15B Open-Source Model, Focused on Miniaturization and Multimodal Capabilities

DeepSeek V4 to be released next week: Native support for audio, video, image, and text generation, compatible with domestic computing power

Doubao Large Model 2.0 Officially Released, Inference Cost Reduced by an Order of Magnitude, API Now Opened

AI Products

Grok 4

Step-R1-V-Mini

Grok 3

Kimi k1.5

Models

Gemini 2.0 Flash-Lite

GPT-4.1 mini

Grok 4 Fast

GPT-5 Codex

Claude 3 Opus

Gemini 2.0 Flash

Claude Haiku 4.5

Gemini 2.5 Flash

Claude Sonnet 4.5

Gemini 2.5 Flash-Lite

qwen3-vl-235b-a22b-thinking

qwen3-coder-plus

qwen3-max

qwen3-vl-plus

qwen-image-edit

qwen3-livetranslate-flaltimeash-re-2025-09-22

wan2.5-i2v-preview

wan2.5-t2i-preview

wan2.5-t2v-preview

qwen3-omni-flash-realtime

OpenMMReasoner RL

ERNIE 4.5 VL 28B A3B Thinking AWQ 8bit

Qwen3 VL 12B Thinking Brainstorm20x NEO MAX GGUF

Qwen3 VL 2B Thinking GGUF

Qwen3 VL 8B Thinking GGUF

Qwen3 VL 4B Instruct GGUF

Qwen3 VL 30B A3B Instruct GGUF

Qwen3 VL 4B Instruct GGUF

Qwen_Qwen3 VL 2B Thinking GGUF

Qwen3 VL 2B Instruct GGUF

Qwen3 VL 30B A3B Instruct GGUF

LFM2 VL 3B

Qwen3 VL 32B Thinking 4bit

NVIDIA Nemotron Nano 12B V2 VL BF16

Qwen3 VL 2B Instruct

Qwen3 VL 30B A3B Instruct AWQ

Qwen3vl 8B Thinking 4bit Mlx

Qwen3 VL 4B Instruct NPU

Qwen3 Omni 30B A3B Thinking GGUF Q4_K_S

Bee 8B RL