Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

AIbase基地

Published inAI News · 4 min read · Oct 16, 2025

Google has recently launched the video generation model Veo 3.1, an upgraded version of Veo 3 released in May this year. The new version has improved in audio output, granularity of editing control, and image-to-video quality, enabling the generation of more realistic video clips and more accurately following user instructions.

In terms of functionality, Veo 3.1 allows users to add new objects to videos, and the system automatically integrates them into the existing visual style. Google also revealed that it will soon support removing existing objects from videos in its video editing tool Flow, further enhancing editing flexibility.

Veo 3 previously offered multiple editing features, including generating characters based on reference images, generating the middle content of a video from the first and last frames by AI, and expanding existing videos based on the last frame. The core upgrade of Veo 3.1 is adding audio generation capabilities to all these editing functions, giving the output video clips sound elements and improving the completeness and immersion of the content.

From the deployment perspective, Veo 3.1 will be available to users through multiple platforms. Google is integrating the model into the video editor Flow, the Gemini app, and the Vertex AI and Gemini API interfaces for developers. According to data disclosed by Google, over 275 million videos have been created on the Flow platform since its launch in May.

This update reflects the evolution of AI video generation technology in two directions. One is the continuous improvement of generation quality—more realistic visuals and more accurate understanding of user prompts. The other is the refinement of editing capabilities—from overall generation to local modifications and fine operations such as adding or removing objects. The addition of audio generation fills a common shortcoming of AI video tools, which previously lacked sound elements.

However, from the perspective of technical maturity, AI video generation is still in a phase of rapid iteration. The coherence of videos, the accuracy of physical laws, and the ability to handle complex scenes are continuously being improved by various models. The actual performance of Veo 3.1, including the synchronization quality of audio and video, the naturalness of object integration, and other details, still needs to be verified through user experience.

The Truth Behind the Korean Server Win Rate Monster: LCK Player Roamer Responds to AI Doubts with a 93% Win Rate

Recently, a player account named 택배기사#한 진 appeared on the Korean Server of League of Legends, achieving an impressive record of 53 wins and 4 losses with a 93% win rate, quickly climbing to the top and sparking widespread discussion among players. Due to its overly refined gameplay and astonishing results, many players suspected that this account was being operated by a high-level AI.

Nadella's Key Speech at Davos: AI's Token Becomes a New Global Commodity, Energy Costs Will Determine a Nation's AI Competitiveness

Microsoft CEO Nadella proposed at the Davos Forum that 'AI's token' is becoming a new type of global commodity, and its energy costs will directly affect GDP growth of countries. This means AI competition has escalated to the level of national energy strategy and infrastructure. Computing power has become a tangible resource driven by electricity, determining regional economic potential.

ChatGPT Atlas Browser Beta: Automatically Generate YouTube Timestamps with Built-in Proxy Mode

The ChatGPT Atlas browser, developed by OpenAI, has undergone a major update, entering the "active interaction" era. Based on the Chromium engine, it enhances web page comprehension capabilities and introduces new features such as "operation" and video parsing technology, aiming to redefine the browsing experience. Its core advantage lies in natively integrating AI capabilities into the browser's core, achieving a deep integration from "chat window" to "browser brain", thereby improving users' daily browsing efficiency.

Meitu Show秀 Tops the AI Image Export Benchmark, AI Flashlight Ignites Global Creative Trend, 2025 Marks the Entry of High-Quality AI Image Era

Meitu Show has broken the barriers of image creation with innovative features such as "AI Flashlight", becoming a model for AI applications in 2025. This feature, through intelligent lighting reconstruction and detail enhancement technology, helped the product be selected for the Annual AI 100 list and ranked among the top ten strong AI products. It was also awarded as an outstanding export application, demonstrating the global strength of domestic AI.

AI Becomes a New Required Course! Shanghai's Fourth and Seventh Graders Will Start Classes in Full, with No Less Than 30 Class Hours Per Academic Year

Shanghai has fully implemented AI education in primary and secondary schools, with mandatory 'AI Fundamentals' courses for 4th and 7th graders, offering at least 30 class hours annually. Schools are encouraged to develop unique curricula under the 'one school, one plan' initiative, promoting diverse AI education. AI is also integrated as an interdisciplinary tool in teaching practices.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Google Launches Veo 3.1 Video Generation Model: New Audio Features and Fine-Grained Editing Capabilities

AIbase基地

This article is from AIbase Daily

AI News Recommendations

The Truth Behind the Korean Server Win Rate Monster: LCK Player Roamer Responds to AI Doubts with a 93% Win Rate

OpenAI Collaborates with Gates Foundation to Invest $50 Million to Use AI to Bridge Africa's Healthcare Gap

Nadella's Key Speech at Davos: AI's Token Becomes a New Global Commodity, Energy Costs Will Determine a Nation's AI Competitiveness

MiniMax Launches Expert Agent Desktop Version, Building an AI-Native Workbench. Users Can Customize Professional-Level Intelligent Assistants

ChatGPT Atlas Browser Beta: Automatically Generate YouTube Timestamps with Built-in Proxy Mode

Reject High Subscription Fees: Block Launches Open-Source AI Coding Assistant Goose, Targeting the Pain Points of Claude Code

Meitu Show秀 Tops the AI Image Export Benchmark, AI Flashlight Ignites Global Creative Trend, 2025 Marks the Entry of High-Quality AI Image Era

AI Becomes a New Required Course! Shanghai's Fourth and Seventh Graders Will Start Classes in Full, with No Less Than 30 Class Hours Per Academic Year

DeepSeek Code Repository Shows Mysterious Identifier MODEL1, New Flagship to Be Unveiled in February

Xiaomi MiMo Large Model Payment Function Launches, the Paid Era is About to Begin!

AI News Recommendations

The Truth Behind the Korean Server Win Rate Monster: LCK Player Roamer Responds to AI Doubts with a 93% Win Rate

OpenAI Collaborates with Gates Foundation to Invest $50 Million to Use AI to Bridge Africa's Healthcare Gap

Nadella's Key Speech at Davos: AI's Token Becomes a New Global Commodity, Energy Costs Will Determine a Nation's AI Competitiveness

MiniMax Launches Expert Agent Desktop Version, Building an AI-Native Workbench. Users Can Customize Professional-Level Intelligent Assistants

ChatGPT Atlas Browser Beta: Automatically Generate YouTube Timestamps with Built-in Proxy Mode

Reject High Subscription Fees: Block Launches Open-Source AI Coding Assistant Goose, Targeting the Pain Points of Claude Code

Meitu Show秀 Tops the AI Image Export Benchmark, AI Flashlight Ignites Global Creative Trend, 2025 Marks the Entry of High-Quality AI Image Era

AI Becomes a New Required Course! Shanghai's Fourth and Seventh Graders Will Start Classes in Full, with No Less Than 30 Class Hours Per Academic Year

DeepSeek Code Repository Shows Mysterious Identifier MODEL1, New Flagship to Be Unveiled in February

Xiaomi MiMo Large Model Payment Function Launches, the Paid Era is About to Begin!

GEO Services