Best Learning Models AI Tools & Models - Premium Learning Models News

AI News

Alibaba TONGYI Qwen Open Source Qwen3.5 Small Model Series: Multimodal Agent Can Run on Edge Devices

The Alibaba TONGYI Qwen team has launched the Qwen3.5 small model series, including four lightweight models of 0.8B, 2B, 4B, and 9B, along with their corresponding base versions. They are based on a unified architecture, equipped with native multimodal capabilities (supporting image-text processing), with structural improvements and reinforcement learning training that can be scaled, achieving higher intelligence levels with fewer computing resources. Among them, the 0.8B and 2B models are extremely compact and fast in inference, specifically optimized for edge devices.

13.4k 4 minutes ago

Apple Papers Shock Again! Qwen3-Coder Surpasses GPT-5 After Special Tuning?

Apple team surpassed leading large models in UI design by improving open-source models. Traditional AI-generated code performs poorly in UI design because reinforcement learning with human feedback is too crude. Apple achieved a breakthrough with fine-tuning, enabling a small model to excel in specific tasks and solving the long-standing issue of interface development for developers.

53.3k 4 hours ago

DeepMind Veteran David Silver Leaves to Start His Own Venture: Betting on Reinforcement Learning to Challenge the Limitations of Large Models

Core figure from DeepMind, David Silver, leaves to start his own company, Ineffable Intelligence. He argues that AI should not rely solely on human data to train large models, but should explore more autonomous paths for intelligence. His departure marks a shift of top AI talent toward more experimental new directions.

10.6k 4 hours ago

New Benchmark for Domestic Reasoning Models! Alibaba Releases Qwen3-Max-Thinking with Trillion-Parameter Performance Targeting GPT-5.2

Alibaba's Qwen3-Max-Thinking, a trillion-parameter model, excels in complex reasoning, factual knowledge, and agent capabilities. Trained with large-scale reinforcement learning and adaptive tool calling, it matches top models like GPT-5.2-Thinking.....

12.6k yesterday

New Benchmark for Domestic Reasoning Models! Alibaba Releases Qwen3-Max-Thinking with Trillion-Parameter Performance Targeting GPT-5.2

AI Products

Search-R1

A highly efficient reinforcement learning framework for training language models that perform reasoning and call search engines.

Model training and deployment

11.2k

d1

Improving the reasoning capabilities of diffusion large language models using reinforcement learning.

Writing assistant

9.7k

Factorio Learning Environment

A testing and learning environment for large language models based on the game Factorio

Model training and deployment

11k

SWE-RL

Enhancing the reasoning capabilities of large language models in open-source software evolution through reinforcement learning.

Code assistant

8.9k

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-vl-235b-a22b-thinking

Alibaba

Input tokens/M

$20

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

wan2.5-i2i-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

Qianfan-Lightning

Baidu

Input tokens/M

Output tokens/M

128

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen-image-plus

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

MCP

Wrike Mcp Server

The Wrike MCP server is a lightweight implementation used to connect the Wrike project management platform with language learning models (LLMs), providing API interfaces for task querying, comment adding, task creation, etc.

python

10.5k

2.5points

MlflowMCPServer

This project provides a natural language interaction interface for MLflow through the Model Context Protocol (MCP), allowing users to query and manage machine learning experiments and models in English. It includes server - side and client - side components.

python

9.6k

2.5points

MLflow

This project provides a Model Context Protocol (MCP) service for MLflow through a natural language interface, simplifying the management and query of machine learning experiments and models.

python

6.3k

2.5points

Rag Mcp Pipeline Research

An open - source project researching the integration of Retrieval Augmented Generation (RAG) with Multi - Cloud Processing (MCP) servers, focusing on the application of free models in business software and providing a modular learning path and practical cases.

python

8.5k

2.0points

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

Alibaba TONGYI Qwen Open Source Qwen3.5 Small Model Series: Multimodal Agent Can Run on Edge Devices

Apple Papers Shock Again! Qwen3-Coder Surpasses GPT-5 After Special Tuning?

DeepMind Veteran David Silver Leaves to Start His Own Venture: Betting on Reinforcement Learning to Challenge the Limitations of Large Models

New Benchmark for Domestic Reasoning Models! Alibaba Releases Qwen3-Max-Thinking with Trillion-Parameter Performance Targeting GPT-5.2

AI Products

Search-R1

d1

Factorio Learning Environment

SWE-RL

Models

Gemini 2.0 Flash-Lite

GPT-4.1 mini

Grok 4 Fast

o3-mini

GPT-5 Codex

Claude 3 Opus

Gemini 2.0 Flash

Claude Haiku 4.5

Gemini 2.5 Flash

Claude Sonnet 4.5

Claude 3 Sonnet

Gemini 2.5 Flash-Lite

qwen3-vl-235b-a22b-thinking

qwen3-coder-plus

wan2.5-i2i-preview

Qianfan-Lightning

qwen3-max

qwen3-vl-plus

qwen-image-plus

qwen-image-edit

Actio Ui 7b Rlvr GGUF

Nanbeige4 3B Thinking 2511

OpenMMReasoner RL

Olmo 3 7B Instruct DPO

Wavjepa Base

Pokee_research_7b GGUF

Gelato 30B A3B

G2RPO

Apriel 1.5 15b Thinker GGUF

QuestA Nemotron 1.5B

MiMo Audio 7B Instruct

MiMo Audio 7B Base

DeepSeek GRM 16B

Dinov3 Vits16plus Pretrain Lvd1689m

Dinov3 Vits16 Pretrain Lvd1689m

Dinov3 Vit7b16 Pretrain Lvd1689m

GLM 4.1V 9B Thinking GGUF

GLM 4.1V 9B Thinking

Deep Ignorance Unfiltered

AReaL Boba 2 8B

MCP

Wrike Mcp Server

MlflowMCPServer

MLflow

Rag Mcp Pipeline Research