Zhejiang University and Alibaba jointly launch OmniAvatar: A full-body digital human model driven by audio makes a stunning debut

AIbase基地

Published inAI News · 3 min read · Jul 2, 2025

Zhejiang University and Alibaba have jointly launched a new audio-driven model called OmniAvatar, marking a new height in digital human technology. The model uses audio as the driving force to generate natural and smooth full-body digital human videos, especially showing excellent performance in singing scenarios, with accurate synchronization between mouth movements and audio lip shapes, resulting in realistic effects.

OmniAvatar supports fine control of generated details through text prompts, allowing users to customize the range of character movements, background environment, and emotional expressions, demonstrating high flexibility. In addition, the model can generate videos of virtual characters interacting with objects, providing broad application opportunities in commercial scenarios such as e-commerce and marketing advertisements. For example, brands can use OmniAvatar to create dynamic ads, enhancing consumer interaction experiences.

As an open-source project, OmniAvatar has been released on GitHub and has attracted global developers' attention. Its excellent performance in facial expressions, upper body, and full-body animation generation surpasses existing similar models. According to reports, the model also supports multi-scene applications, including podcast programs, interpersonal interactions, and dynamic performances, demonstrating its great potential in the field of content creation.

Industry experts stated that the release of OmniAvatar not only enhances the authenticity and controllability of audio-driven digital human technology but also promotes the innovative application of AI in fields such as marketing, education, and entertainment. In the future, Zhejiang University and Alibaba will continue to deepen their cooperation and explore more possibilities of multimodal AI.

OmniAvatar Zhejiang University Alibaba Digital Human Technology

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

Welcome to the [AI Daily] section! Here is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and learn about innovative AI product applications. Click to learn more about new AI products: https://top.aibase.com/1、Open Source End-to-End Speech Large Model Step-Audio-AQAA: Understand audio and directly generate natural speech. Step-Audio-AQAA is an open source end-to-end speech large model,

Jul 2, 2025

300

State Administration for Market Regulation Approves the Release of 7 National Standards Including Artificial Intelligence, Information Technology, and Internet of Things

Jul 2, 2025

210

World Robot Dog Competition to Begin: Black Panther 2.0 Challenges Extreme Missions and 100-Meter Human vs. Machine Duel

Jul 2, 2025

340

Microsoft Launches Groundbreaking Medical AI System MAI-DxO: Diagnostic Accuracy Far Exceeds Human Experts

Microsoft CEO Satya Nadella recently announced on a social platform that Microsoft has officially launched the revolutionary medical AI system MAI-DxO. This innovative system stands out with its unique "model-agnostic" design, allowing it to flexibly adapt to language models of different manufacturers and capabilities, thereby significantly improving their diagnostic performance. More excitingly, MAI-DxO is not only able to simulate the diagnostic process of real doctors, but also demonstrated diagnostic accuracy far exceeding that of professional physicians in tests, while greatly reducing the cost of medical diagnosis. Microsoft has released test data.

Jul 2, 2025

430

Capital One Revolutionizes Car Sales with AI Technology

Jul 2, 2025

Honor Launches a New Battle in AI Voice Technology, the World's First Edge-side Voice Large Model to Be Launched!

Honor's official Weibo account @MagicOS announced that Honor has successfully deployed the world's first edge-side voice large model. This technological advancement is not only a breakthrough for Honor, but also hailed as a 'renewal of AI voice technology'. This significant achievement will make its debut on the overseas version of the upcoming Honor Magic V5. Honor's technological innovation is the result of its in-depth efforts in the field of artificial intelligence. It is reported that Honor has published two academic papers at the prestigious international conference InterSpeech, which have attracted widespread attention from the academic community.

Jul 2, 2025

190

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Discover new AI products: https://top.aibase.com/1. Qwen-TTS Launches with a Major Breakthrough in Dialect Speech Synthesis, Achieving Realism Close to Human Voices. The Qwen-TTS model, developed by Alibaba's Tongyi team, has made significant breakthroughs in the field of speech synthesis.

Jul 1, 2025

320

Qwen-TTS Launches with Major Breakthrough in Dialect Speech Synthesis, Realism Comparable to Human Voices

Jul 1, 2025

380

ByteDance Releases Innovative Image Synthesis Technology XVerse: Independent and Precise Control over Multiple Individuals

On June 26, 2025, ByteDance officially launched its latest image synthesis technology - XVerse, aimed at providing a high-precision multi-subject image generation solution. This innovative technology enables users to independently and precisely control multiple individuals, greatly enhancing the ability to generate personalized and complex scenes. The core of XVerse lies in its unique DiT modulation method, which allows control over the identity and semantic attributes of each subject without affecting the overall latent features of the image. By converting reference images into specific characteristics...

Jul 1, 2025

370

Tesla Full Self-Driving Delivery Video Shocks the World: Fully Autonomous from Factory to Customer's Home!

Tesla once again leads the automotive industry's technological revolution! Recently, Tesla released the world's first artificial intelligence (AI) full self-driving (FSD) delivery video from factory to customer's home, showcasing the latest breakthroughs in its autonomous driving technology. This 17-mile journey, lasting about 30 minutes, spans parking lots, highways, and city roads, ultimately delivering the vehicle accurately to the new owner's home. Full autonomous driving, a technological milestone. The video released by Tesla demonstrates the impressive performance of its FSD system in real-world scenarios. Starting from the factory, the car

Jul 1, 2025

140

AI News

AI Daily

AI Timeline

Al Hardware

Latest Cases

Image Collection

Video Collection

Audio Collection

Content Collection

Latest Tutorials

AI Product Ranking

AI Traffic Growth Ranking

AI Traffic Decline Ranking

AI Weekly Ranking

United States

China

India

Brazil

Image Generation

Personal Assistant

Character Generation

Video Generation

AI Project Ranking

AI Project Growth Ranking

AI Developer Ranking

AI Organization Ranking

Deepseek

TTS

LLM

ChatGPT

Overview

Zhejiang University and Alibaba jointly launch OmniAvatar: A full-body digital human model driven by audio makes a stunning debut

AIbase基地

This article is from AIbase Daily

AI News Recommendations

AI Daily: Baidu Launches Drawn-Imagine Platform and MuseSteamer; Alibaba's Audio-Driven Full-Body Digital Human Model OmniAvatar

State Administration for Market Regulation Approves the Release of 7 National Standards Including Artificial Intelligence, Information Technology, and Internet of Things

World Robot Dog Competition to Begin: Black Panther 2.0 Challenges Extreme Missions and 100-Meter Human vs. Machine Duel

Microsoft Launches Groundbreaking Medical AI System MAI-DxO: Diagnostic Accuracy Far Exceeds Human Experts

Capital One Revolutionizes Car Sales with AI Technology

Honor Launches a New Battle in AI Voice Technology, the World's First Edge-side Voice Large Model to Be Launched!

AI Daily: Alibaba Tongyi Launches Qwen-TTS Model; Cursor Now Supports Web and Mobile; ByteDance Unveils Image Synthesis Technology XVerse

Qwen-TTS Launches with Major Breakthrough in Dialect Speech Synthesis, Realism Comparable to Human Voices

ByteDance Releases Innovative Image Synthesis Technology XVerse: Independent and Precise Control over Multiple Individuals

Tesla Full Self-Driving Delivery Video Shocks the World: Fully Autonomous from Factory to Customer's Home!