Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

AIbase基地

Published inAI News · 3 min read · Dec 3, 2025

At the re:Invent2025 conference, Amazon Web Services (AWS) launched four "Nova2" series self-developed large models, covering multi-modal scenarios such as text, images, videos, and speech. For the first time, they have built-in web search and code execution capabilities, claiming to achieve "industry-leading price-performance ratio" for the same tasks.

Performance Comparison

- Nova2Lite: Positioned as a high-cost-performance inference model, it matches or exceeds Claude Haiku4.5 in 13 out of 15 benchmarks, and matches or exceeds GPT-5Mini in 11 out of 17 benchmarks, with a cost of about 50% of the latter.

- Nova2Pro: Designed for complex Agent tasks, it matches or exceeds Claude Sonnet4.5 in 10 out of 16 evaluations, and matches or exceeds Gemini3Pro Preview in 8 out of 18 evaluations.

- Nova2Sonic: An end-to-end speech model, with real-time latency below 600ms, supporting a context of up to one million tokens and asynchronous background tasks.

- Nova2Omni: The industry's first unified multi-modal model, capable of inputting text/images/videos/audio and outputting text + images, completing understanding and generation with a single model.

Technical Highlights

All models in the series are integrated with "web search + code execution" dual tools, enabling real-time internet information retrieval and Python execution, ensuring answers based on the latest facts rather than just training data. AWS stated that tens of thousands of enterprises have already used the Nova series for content production, multi-step automation, and AI Agent development.

Market Strategy

AWS also launched the "Nova Forge" custom training service, which costs $100,000 annually to inject private data during the pre-training or post-training phase, building a tailored cutting-edge model. The goal is to reduce the cost of enterprises building large models from "hundreds of millions of dollars" to the million-dollar level.

Industry Perspectives

Nova2 AWS LargeModel Multimodal

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

Lilian Weng returns with a deep dive into scaling laws, arguing the industry consensus may be reversed: from Kaplan to Chinchilla, the mainstream data allocation might not be optimal. It examines compute, model size, and data quantity trade-offs, implying the billions-invested path requires reconsideration, prompting a re-evaluation of pretraining recipes.....

Jun 26, 2026

270

Amazon Invests Heavily in India: Plans to Invest 13 Billion USD in AI and Cloud Infrastructure

Amazon announced that it will invest an additional 13 billion USD in India by 2030, focusing on expanding AWS data centers in Mumbai and Hyderabad, and strengthening its AI and cloud service capabilities.

Jun 26, 2026

160

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models

DeepSeek announced plans to double team sizes across seven roles including full-stack development and AI core system R&D, highlighting its aggressive push towards AGI and strong capabilities. The high-profile AI innovator continues its global momentum.....

Jun 26, 2026

270

Performance Improved 475 Times! Fujitsu Unveils New PHOTON Architecture, Targeting AI Computing Bottlenecks

Fujitsu launched PHOTON, an innovative architecture using top-down parallel layered computing for networks. It targets the bottleneck in Transformer processing for long texts and high concurrency, caused by frequent memory access for historical context, aiming to break computational cost and efficiency limits.....

Jun 26, 2026

170

SenseTime Enters the Intelligent Agent Arena: New All-Modal Base Model is Ready to Launch

The competition in large models is shifting towards agents. SenseTime is developing the industry's first natively multimodal agent base, integrating a unified core of "understanding, generation, and action", directly benchmarking against GPT-Image 2, and pushing AI from passive Q&A to active execution.....

Jun 25, 2026

400

SenseTime Secretly Developing Multimodal Model U1Pro: Led by Lin Dahua, Expected to Launch Internal Testing in July, Targeting OpenAI

SenseTime is secretly developing the multimodal large model U1Pro, targeting design scenarios, led by Chief Scientist Lin Dahua. The model belongs to the "Ri Ri Xin" family, aiming to compete with OpenAI's GPT-Image2, emphasizing long-range logic and thinking capabilities, and expected to launch internal testing and commercial use in July.

Jun 25, 2026

410

2026 Global Unicorn Total Valuation Surges 43%: Large Models Spark Capital Mania, Reshaping the Focus of the Global Tech Industry

The Hurun Global Unicorn Index 2026 reveals AI reshapes business, with total unicorn value surging 43% YoY to 54 trillion yuan. FinTech leads with 216 unicorns, but AI has 215, up 87 in a year, accounting for 36% of value. The focus has shifted to large models and AI.....

Jun 25, 2026

250

Breaking the Barrier of Multimodal Switching! Google Brings Native Computer Operations into Gemini 3.5 Flash

Google DeepMind integrates native computer use capabilities into Gemini 3.5 Flash. Developers can now use a single model for building autonomous AI agents that operate across browsers, phones, and desktops. This eliminates context switching between models, streamlining long-running cross-platform tasks.....

Jun 25, 2026

280

Enhancing the WENXIN 5.1 Foundation: Baidu WENXIN Website Fully Expands, Introducing New Tools Such as Office Online Editing

Baidu merges its Wenxin-related sites into a new "Baidu Wenxin Website", creating a one-stop AI service super entry to lower user barriers and boost efficiency, backed by the upgraded Wenxin 5.1 large model.....

Jun 25, 2026

170

Google Gemini 3.5 Pro Release Delayed, Refining Core Capabilities Becomes the Top Priority

Google's next flagship Gemini 3.5 Pro, originally slated for release this month, has been postponed to July. The delay is not due to technical stagnation but to allow the R&D team more time for deeper optimization and refinement, aiming for higher product maturity. This reflects the intense competition in computing power and models, with major players more cautiously balancing release timing and quality.....

Jun 25, 2026

300

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Ranking Monitor

AI Conversation Insight

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

LLM API Proxy Checker

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

Amazon Invests Heavily in India: Plans to Invest 13 Billion USD in AI and Cloud Infrastructure

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models

Performance Improved 475 Times! Fujitsu Unveils New PHOTON Architecture, Targeting AI Computing Bottlenecks

SenseTime Enters the Intelligent Agent Arena: New All-Modal Base Model is Ready to Launch

SenseTime Secretly Developing Multimodal Model U1Pro: Led by Lin Dahua, Expected to Launch Internal Testing in July, Targeting OpenAI

2026 Global Unicorn Total Valuation Surges 43%: Large Models Spark Capital Mania, Reshaping the Focus of the Global Tech Industry

Breaking the Barrier of Multimodal Switching! Google Brings Native Computer Operations into Gemini 3.5 Flash

Enhancing the WENXIN 5.1 Foundation: Baidu WENXIN Website Fully Expands, Introducing New Tools Such as Office Online Editing

Google Gemini 3.5 Pro Release Delayed, Refining Core Capabilities Becomes the Top Priority

AI News Recommendations

Three-Year Delayed Long Article: Former OpenAI Security VP Wang Li Analyzes Scaling Laws: Your Model May Have Been Trained on the Wrong Data

Amazon Invests Heavily in India: Plans to Invest 13 Billion USD in AI and Cloud Infrastructure

The Eve of AGI: DeepSeek Expands All Departments by Double, Intensifying the Competition for Top Talent in Large Models

Performance Improved 475 Times! Fujitsu Unveils New PHOTON Architecture, Targeting AI Computing Bottlenecks

SenseTime Enters the Intelligent Agent Arena: New All-Modal Base Model is Ready to Launch

SenseTime Secretly Developing Multimodal Model U1Pro: Led by Lin Dahua, Expected to Launch Internal Testing in July, Targeting OpenAI

2026 Global Unicorn Total Valuation Surges 43%: Large Models Spark Capital Mania, Reshaping the Focus of the Global Tech Industry

Breaking the Barrier of Multimodal Switching! Google Brings Native Computer Operations into Gemini 3.5 Flash

Enhancing the WENXIN 5.1 Foundation: Baidu WENXIN Website Fully Expands, Introducing New Tools Such as Office Online Editing

Google Gemini 3.5 Pro Release Delayed, Refining Core Capabilities Becomes the Top Priority