Best Autoregressive Model AI Tools & Models - Premium Autoregressive Model News

AI News

50 Million USD in Seed Funding! Stanford Professor Founded Inception to Challenge GPT-5 with a Diffusion-based Large Model, Code Generation Speed Exceeds 1000 Token/Second

Stanford professor Stefano Ermon's AI startup Inception raised $50M in seed funding led by Menlo Ventures, with Microsoft, NVIDIA, and Andrew Ng participating. The company aims to disrupt the autoregressive model landscape with innovative architecture.....

10.2k 4 days ago

Inception Returns to the AI Track, $50 Million in Funding Drives the Rise of a New Model

AI startup Inception raised $50M led by Menlo Ventures, with Microsoft and Nvidia participating. Founded by Mustafa Suleyman, the company shifts focus to diffusion models (dLLMs), differentiating from traditional autoregressive LLMs.....

7.7k yesterday

Inception Returns to the AI Track, $50 Million in Funding Drives the Rise of a New Model

First to Surpass Autoregressive Models! Ant Group Opens Sourced the Industry's First High-Performance Diffusion Language Model Inference Framework dInfer

Ant Group open-sourced dInfer, the first high-performance diffusion language model framework. Benchmarks show it's 10.7x faster than NVIDIA's Fast-dLLM, achieving 1011 tokens/sec in HumanEval tasks—first to surpass autoregressive models in speed.....

10k 7 hours ago

First to Surpass Autoregressive Models! Ant Group Opens Sourced the Industry's First High-Performance Diffusion Language Model Inference Framework dInfer

Challenging Conventional Wisdom! Ant Group and Renmin University to Launch the First Native MoE Diffusion Language Model in the Industry at the 2025 Bund Conference

Ant Group and Renmin University developed LLaDA-MoE, a native MoE-based diffusion language model trained on 20T data, demonstrating scalability and stability. It outperforms LLaDA1.0/1.5 and Dream-7B, rivals autoregressive models with faster inference. The model will be open-sourced soon.....

9.2k yesterday

Challenging Conventional Wisdom! Ant Group and Renmin University to Launch the First Native MoE Diffusion Language Model in the Industry at the 2025 Bund Conference

AI Products

Grok Aurora

A next-generation autoregressive image generation model offering multimodal input and advanced image editing features.

Image generation

9.9k

OLMo 2 13B

High-performance English academic benchmark language model

AI model

7.7k

Pyramid Flow miniFLUX

Highly efficient autoregressive video generation model

Video generation

9.9k

Open-MAGVIT2

Open-source autoregressive visual generation model project

AI image generation

9.9k

Models

Gemini 2.0 Flash-Lite

Google

$0.49

Input tokens/M

$2.1

Output tokens/M

Context Length

GPT-4.1 mini

Openai

$2.8

Input tokens/M

$11.2

Output tokens/M

Context Length

Grok 4 Fast

Xai

$1.4

Input tokens/M

$3.5

Output tokens/M

Context Length

o3-mini

Openai

$7.7

Input tokens/M

$30.8

Output tokens/M

200

Context Length

GPT-5 Codex

Openai

Input tokens/M

Output tokens/M

Context Length

Claude 3 Opus

Anthropic

$105

Input tokens/M

$525

Output tokens/M

200

Context Length

Gemini 2.0 Flash

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

Claude Haiku 4.5

Anthropic

Input tokens/M

$35

Output tokens/M

200

Context Length

Gemini 2.5 Flash

Google

$2.1

Input tokens/M

$17.5

Output tokens/M

Context Length

Claude Sonnet 4.5

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Claude 3 Sonnet

Anthropic

$21

Input tokens/M

$105

Output tokens/M

200

Context Length

Gemini 2.5 Flash-Lite

Google

$0.7

Input tokens/M

$2.8

Output tokens/M

Context Length

qwen3-vl-235b-a22b-thinking

Alibaba

Input tokens/M

$20

Output tokens/M

Context Length

qwen3-coder-plus

Alibaba

Input tokens/M

$16

Output tokens/M

Context Length

wan2.5-i2i-preview

Alibaba

Input tokens/M

Output tokens/M

Context Length

Qianfan-Lightning

Baidu

Input tokens/M

Output tokens/M

128

Context Length

qwen3-vl-plus

Alibaba

Input tokens/M

$10

Output tokens/M

256

Context Length

qwen3-max

Alibaba

Input tokens/M

$24

Output tokens/M

256

Context Length

qwen-image-plus

Alibaba

Input tokens/M

Output tokens/M

Context Length

qwen-image-edit

Alibaba

Input tokens/M

Output tokens/M

Context Length

Empowering the future, your artificial intelligence solution think tank

English 简体中文繁體中文にほんご

FirendLinks:

AI Newsletters AI Tools MCP Servers AI News AIBase LLM Leaderboard AI Ranking

Business Cooperation Site Map

AI News

50 Million USD in Seed Funding! Stanford Professor Founded Inception to Challenge GPT-5 with a Diffusion-based Large Model, Code Generation Speed Exceeds 1000 Token/Second

Inception Returns to the AI Track, $50 Million in Funding Drives the Rise of a New Model

First to Surpass Autoregressive Models! Ant Group Opens Sourced the Industry's First High-Performance Diffusion Language Model Inference Framework dInfer

Challenging Conventional Wisdom! Ant Group and Renmin University to Launch the First Native MoE Diffusion Language Model in the Industry at the 2025 Bund Conference

AI Products

Grok Aurora

OLMo 2 13B

Pyramid Flow miniFLUX

Open-MAGVIT2

Models

Gemini 2.0 Flash-Lite

GPT-4.1 mini

Grok 4 Fast

o3-mini

GPT-5 Codex

Claude 3 Opus

Gemini 2.0 Flash

Claude Haiku 4.5

Gemini 2.5 Flash

Claude Sonnet 4.5

Claude 3 Sonnet

Gemini 2.5 Flash-Lite

qwen3-vl-235b-a22b-thinking

qwen3-coder-plus

wan2.5-i2i-preview

Qianfan-Lightning

qwen3-vl-plus

qwen3-max

qwen-image-plus

qwen-image-edit

Olmo 3 7B Instruct AIO GGUF

Dialogsum T5 Small

Olmo 3 32B Think SFT

NVIDIA Nemotron Nano 12B V2 VL NVFP4 QAD

RND1 Base 0910

Ming UniVision 16B A3B

Qwen3 14B NVFP4

Qwen3 8B FP8

Qwen3 8B NVFP4

SDAR 4B Chat

DeepSeek R1 0528 NVFP4 V2

PARD Qwen3 0.6B

Qwen3 30B A3B NVFP4

Qwen3 235B A22B NVFP4

Qwen3 235B A22B FP4

Qwen3 235B A22B FP8

SongBloom

Show O2 7B

DeepSeek R1 0528 FP4

ReasonGen R1