Best small language models AI Tools & Models - Premium small language models News

AI News

AI Black Box Awakening: Google AI Learns Language on Its Own, Where Is the Boundary of Human Intelligence Control?

The CEO of Google admitted that they do not have full control over the operating mechanism of AI systems, revealing the mystery of the AI black box. Large language models demonstrate emergent behaviors through training with massive data, such as the Google PaLM model, which can handle Bengali translation with only a small amount of data, reflecting the leap from training to self-study by AI.

14.9k 6 hours ago

Dell GB10: Desktop Supercomputing Leads a New Era of Local AI

As small to medium language models improve, AI developers question the need for costly cloud computing. Local computing struggles with memory limits for 3B or 7B parameter models, keeping development reliant on remote infrastructure.....

11.6k yesterday

Mistral Releases Devstrall2 Open-Source Programming Model: 123 Billion Parameters, Cost Only 1/7 of Claude Sonnet

Mistral AI launches Devstral2 (123B) and Devstral Small2 (24B) open-source coding models, with the flagship achieving 72.2% on SWE-Bench, setting a new open-source record and claiming 7x cost efficiency over Claude Sonnet. Also open-sources CLI tool Mistral Vibe for batch code editing via natural language. Both models are available via API, with Devstral2 priced at $0.40 per million input tokens and the lightweight version free.....

10.1k 8 hours ago

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

Liquid AI company released the second generation of Liquid Foundation Models (LFM2) in July 2025, featuring an innovative "liquid" architecture, aiming to become the fastest on-device foundation model in the market. Its efficient training and inference capabilities allow small models to rival large language models in the cloud. LFM2 initially offers dense checkpoint versions with 350M, 700M, and 1.2B parameters.

9.7k 6 days ago

AI Products

Radal

Radal is a no-code platform that allows you to fine-tune small language models using your own data. Connect your datasets, configure training visually, and deploy models in minutes.

Model training and deployment

6.5k

rStar-Math

Showcases research results demonstrating how small language models can master mathematical reasoning abilities through self-evolution and deep thinking.

Model training and deployment

8.7k

Phi Open Models

Phi Open Models are powerful, cost-effective, low-latency small language models.

AI model

8.4k

SLM_Survey

Research, Measurement, and Insights on Small Language Models

AI academic research

9.2k

Models

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

qwen3-vl-235b-a22b-thinking

Alibaba

Input tokens/M

$20

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

Doubao-Seed-Translation

Bytedance

$1.2

Input tokens/M

$3.6

Output tokens/M

Context Length

qwen3-livetranslate-flaltimeash-re-2025-09-22

Alibaba

Input tokens/M

$240

Output tokens/M

Context Length

wan2.5-i2v-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen3-omni-flash-realtime

Alibaba

$3.9

Input tokens/M

$15.2

Output tokens/M

Context Length

qwen3-tts-flash

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen3-tts-flash-realtime

Alibaba

Input tokens/M

Output tokens/M

Context Length

Kimi-K2

Moonshot

Input tokens/M

$16

Output tokens/M

256

Context Length

Doubao - Seedream - 3.0 - t2i

Bytedance

Input tokens/M

Output tokens/M

Context Length

Doubao-SeedEdit-3.0-i2i

Bytedance

Input tokens/M

Output tokens/M

Context Length

qwen3-asr-flash

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-vl-plus

Alibaba

$0.8

Input tokens/M

Output tokens/M

128

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

AI Black Box Awakening: Google AI Learns Language on Its Own, Where Is the Boundary of Human Intelligence Control?

Dell GB10: Desktop Supercomputing Leads a New Era of Local AI

Mistral Releases Devstrall2 Open-Source Programming Model: 123 Billion Parameters, Cost Only 1/7 of Claude Sonnet

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

AI Products

Radal

rStar-Math

Phi Open Models

SLM_Survey

Models

GPT-4.1 mini

Grok 4 Fast

o3-mini

Claude Haiku 4.5

Claude 3 Sonnet

qwen3-vl-235b-a22b-thinking

qwen3-coder-plus

qwen3-vl-plus

qwen3-max

Doubao-Seed-Translation

qwen3-livetranslate-flaltimeash-re-2025-09-22

wan2.5-i2v-preview

qwen3-omni-flash-realtime

qwen3-tts-flash

qwen3-tts-flash-realtime

Kimi-K2

Doubao - Seedream - 3.0 - t2i

Doubao-SeedEdit-3.0-i2i

qwen3-asr-flash

qwen-vl-plus

Pikachu

Gheya 1

Fara 7B

Granite 4.0 H Small FP8

Trlm 135m GGUF

QuestA Nemotron 1.5B

MobileLLM R1 950M Base

Cthulhu 24B V1.2 GGUF

Cthulhu 24B V1.1 GGUF

SmolLM3 3B

Qwen3 0.6B GGUF

Ovis2 1B Dev

ThinkEdit Deepseek Qwen 14b

Cuckoo C4 Rainbow

Cuckoo C4 Instruct

Cuckoo C4

Selene 1 Mini Llama 3.1 8B

ModernBERT Large Llm Router

TinyCodeLM 150M

Mistral Nemo Instruct 2407