Qwen APP Integrates Wan2.5 Video Capabilities for a New Upgrade

AIbase基地

Published inAI News · 4 min read · Dec 2, 2025

On December 2nd, the Qwen APP integrated the latest model of the Wanxiang series, Wan 2.5, further enhancing its video creation capabilities. The model offers significant improvements in action accuracy and body coordination, and it is the first mobile AI assistant to support simultaneous audio and video output.

The Ali Wanxiang 2.5 is one of the few video models in the industry that supports synchronized audio and video. This model supports multiple tasks such as understanding and generating, and it accepts and outputs various modalities including text, images, videos, and audio. On the authoritative large model evaluation platform LMArena, Wanxiang's image-to-video capability ranks third globally and first domestically.

In the Qwen APP, users only need a photo and a piece of text, without any templates, to generate a high-definition 1080P dance video with natural body movements and accurate lip-sync. The maximum length supported is 10 seconds. Tests show that Qwen APP supports a wide range of subjects, including real person photos, cute pets, anime characters, cultural relics, and cartoon figures.

Last year, when Alibaba launched the photo-dancing feature, it quickly became popular both domestically and internationally, inspiring netizens' creative enthusiasm. Videos of Terracotta Warriors, cute kids, and pets dancing spread across the internet. With the integration of Wanxiang 2.5, the Qwen APP not only significantly improves video creation quality but also further lowers the barrier to video creation, supporting users to upload their own photos and input text. For example, users just need to input an image and a text like "a cat sings and dances," and the Qwen APP can accurately generate a video, bringing static images to life instantly.

This new feature has once again sparked netizens' creative enthusiasm, leading to a surge of more creative "photo dance" content on social platforms. For instance, users can first use the Qwen APP to merge two images into a photo in the style of a medieval painting, then input text such as "the people in the image sing and dance, with a dynamic camera shot," to achieve a video effect of group singing and dancing, while maintaining high-quality dynamic performance and strong subject consistency.

According to reports, during its public beta test, the Qwen App exceeded 10 million downloads within a week, surpassing ChatGPT, Sora, and DeepSeek to become the fastest-growing AI application in history.

Qwen APP Wan2.5 Audio-Video Synchronization AI Video Generation

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Google Search AI Mode Fully Launched: One-Click Conversation Within Results Page, Bypassing Steps Becomes a Thing of the Past

Google's mobile search AI mode is now live globally, enabling users to interact directly with Gemini on results pages for instant follow-ups and multi-turn conversations, streamlining traditional search into 'one scroll, one question.' It uses 'query fan-out' to break queries into subtopics, fetching data from knowledge graphs, real-time sports, finance, and shopping in parallel.....

Dec 2, 2025

NetEase Youdao Dictionary Reveals 2025 Annual Word - DeepSeek Leads with 8.67 Million Searches

The 2025 annual hot words of NetEase Youdao Dictionary have been revealed, with "DeepSeek" topping the list with 8.67 million searches, becoming the first annual word originating from a domestic AI large model. The search popularity rapidly increased after the release of the DeepSeek-R1 model in February, and subsequent technological breakthroughs also drove peak query periods. College students and working professionals are the main search groups, and users often extend their browsing to related concepts such as "large models" after looking up words, forming a chain of "looking up words - learning concepts," reflecting the trend of AI technology popularization driving public awareness deepening.

Dec 2, 2025

Google AI Search Experience Speeds Up: New Design Enables Seamless Conversation Gemini3Pro Enters 120 Markets!

Google is democratizing AI by enhancing design for seamless switching from AI overview to chat, and expanding Gemini 3 Pro globally. New mobile features enable one-tap access to AI conversations, improving user experience.....

Dec 2, 2025

Apple AI Leadership Change: Subramanya, Former Gemini Executive, Takes Charge as Giannandrea Departs in Spring

Apple's Chief AI Officer John Giannandrea steps down, succeeded by former Google Gemini head Amar Subramanya. This move is seen as a response to issues with the 'Apple Intelligence' project, criticized for generating misleading headlines in its summary feature.....

Dec 2, 2025

100

Nvidia Releases New AI Model Alpamayo-R1, Advancing Autonomous Driving Research

NVIDIA unveils new AI infrastructure and models at NeurIPS, advancing physical AI for robotics and autonomous vehicles. Highlights include Alpamayo-R1, the first open reasoning vision-language model designed for autonomous driving, enhancing environmental perception by processing text and images.....

Dec 2, 2025

100

NVIDIA Abandons Physics AI and Reverts: Open-Source Autonomous Driving Inference Model Alpamayo-R1 Allows Vehicles to Think Before Accelerating

NVIDIA unveils Alpamayo-R1, an L4 autonomous driving inference model at NeurIPS 2025. Built on the Cosmos-Reason series, it processes camera, LiDAR, and text instructions simultaneously to output driving decisions via internal reasoning. The model employs an end-to-end unified architecture across vision, language, and action modalities, minimizing error accumulation and aiming to imbue vehicles with 'human-like common sense'.....

Dec 2, 2025

AiShi Technology Launches PixVerse V5.5: The First Director-Level Multi-Camera Narrative Video Large Model in China

AiShi Technology launched PixVerse V5.5 (the domestic version "PaiWo AI V5.5"), achieving a full upgrade and opening for trial. This model is the first AI video large model in China to support "multi-camera + audio-visual synchronization one-click output", promoting AI-generated videos from "single-camera materials" into the "complete narrative short film" stage. Based on its self-developed MVL architecture, V5.5 can automatically complete script breakdown, shot scheduling, and sound effect generation within 5-10 seconds, significantly improving the completeness and efficiency of video production.

Dec 2, 2025

Building AI Applications with Natural Language Becomes a Trend, 3.3 Million Spark Applications Emerge

The 'Lingguang' AI assistant app generated 3.3 million 'flash apps' in two weeks, showcasing its potential to address fragmented daily needs through practical tools and entertainment features.....

Dec 2, 2025

Runway Launches New Gen-4.5 Video Generation Model to Enhance Creativity and Visual Quality

Runway's Gen-4.5 video generation model enhances visual accuracy and creative control, enabling users to create high-definition dynamic videos from brief text prompts, supporting complex scenes and vivid characters, trained and inferred on Nvidia GPUs for optimized precision and style.....

Dec 2, 2025

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

Liquid AI company released the second generation of Liquid Foundation Models (LFM2) in July 2025, featuring an innovative "liquid" architecture, aiming to become the fastest on-device foundation model in the market. Its efficient training and inference capabilities allow small models to rival large language models in the cloud. LFM2 initially offers dense checkpoint versions with 350M, 700M, and 1.2B parameters.

Dec 2, 2025

120

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

Qwen APP Integrates Wan2.5 Video Capabilities for a New Upgrade

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Google Search AI Mode Fully Launched: One-Click Conversation Within Results Page, Bypassing Steps Becomes a Thing of the Past

NetEase Youdao Dictionary Reveals 2025 Annual Word - DeepSeek Leads with 8.67 Million Searches

Google AI Search Experience Speeds Up: New Design Enables Seamless Conversation Gemini3Pro Enters 120 Markets!

Apple AI Leadership Change: Subramanya, Former Gemini Executive, Takes Charge as Giannandrea Departs in Spring

Nvidia Releases New AI Model Alpamayo-R1, Advancing Autonomous Driving Research

NVIDIA Abandons Physics AI and Reverts: Open-Source Autonomous Driving Inference Model Alpamayo-R1 Allows Vehicles to Think Before Accelerating

AiShi Technology Launches PixVerse V5.5: The First Director-Level Multi-Camera Narrative Video Large Model in China

Building AI Applications with Natural Language Becomes a Trend, 3.3 Million Spark Applications Emerge

Runway Launches New Gen-4.5 Video Generation Model to Enhance Creativity and Visual Quality

MIT-based Startup Liquid AI Unveils Enterprise-Level Small Model Training Blueprint LFM2

GEO Services