NVIDIA Open Sources Audio2Face Model AI Assists in Real-Time Facial Animation Generation

AIbase基地

Published inAI News · 4 min read · Sep 25, 2025

Recently, NVIDIA announced the open source of its generative AI facial animation model, Audio2Face. This model not only includes core algorithms but also provides a software development kit (SDK) and a complete training framework, aiming to accelerate the development of intelligent virtual characters in games and 3D applications.

Audio2Face can drive virtual characters' facial movements in real-time by analyzing acoustic features such as phonemes and intonation in audio, generating accurate lip-syncing and natural emotional expressions. This technology is widely applicable in multiple fields such as gaming, film production, and customer service.

Audio2Face model supports two operating modes: one for offline rendering of pre-recorded audio, and another for real-time streaming processing of dynamic AI characters. To facilitate developers, NVIDIA has also open-sourced several key components, including the Audio2Face SDK, a local execution plugin for Autodesk Maya, and a plugin for Unreal Engine 5.5 and above. In addition, regression models and diffusion models are also open-sourced, allowing developers to use the open-source training framework to fine-tune the model with their own data, thus adapting it to specific application scenarios.

Currently, this technology has been widely adopted by many game developers. The game development company Survios integrated Audio2Face into its game "Alien: Fireteam Elite," significantly simplifying the process of lip-syncing and facial capture. Meanwhile, Farm51 studio also applied this technology in their work "Chernobylite 2: Zone," generating detailed facial animations directly from audio, saving a lot of production time and enhancing the realism and immersive experience of the characters. The studio's innovation director, Wojciech Pazdur, described this technology as a "revolutionary breakthrough."

NVIDIA's new initiative undoubtedly provides developers with more creative tools and will further promote the development of virtual character expressions. With the continuous advancement of technology, we can look forward to seeing more realistic and vivid character performances in future games and films.

Entry: https://build.nvidia.com/nvidia/audio2face-3d

Key Points:
🔊 NVIDIA open-sources the Audio2Face model, aiming to improve the facial animation generation technology for virtual characters.
🎮 Supports offline rendering and real-time streaming processing, suitable for various scenarios.
🌟 Has been adopted by multiple game developers, simplifying the production process and enhancing the realism of characters.

KlingAI Avatar 2.0 Launches and Immediately Becomes a Hit: Singing and Dancing in 5 Minutes with One Click, Digital Humans Officially Bid Farewell to the Stiff Expression Era

Kuaishou's Kling AI launches Avatar2.0, enabling users to create up to 5-minute singing videos from a photo and music. The model enhances digital human expressiveness with natural facial and body movements, moving beyond stiff lip-syncing, marking a shift from static to dynamic AI content creation.....

AI Daily: Kuaishou Keling 2.6 Fully Launched; ByteDance Seedream 4.5 Released

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1. Kuaishou Keling 2.6 Fully Launched! 8. DeepSeek Releases Two Major New Models, the Official Version V3.2 and Special Edition Launched Simultaneously. DeepSeek Releases Two Major New Models.

Kuaishou Keling 2.6 Now Fully Released! Audio and Video Created Together, Video, Natural Voice, Matching Sound Effects, Environmental Atmosphere

Kuaishou's Kling AI launches Kling 2.6, the first 'audio-visual simultaneous' model, generating visuals, voice, sound effects, and ambiance together. It offers text-to-audiovisual and image-to-audiovisual creation, enabling users to quickly produce full videos from a sentence or image, enhancing creative experience.....

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

AWS unveils four self-developed 'Nova2' AI models at re:Invent 2025, covering text, image, video, and speech with built-in web search and code execution, claiming leading price-performance. Nova2 Lite offers cost-effective inference, outperforming Claude Haiku4.5 and GPT-5Mini at about half the cost, while Nova2 Pro targets complex agent tasks.....

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

At the 2025 re:Invent conference, Amazon Web Services introduced the Nova2 model series, including four new models, offering leading cost-effectiveness in reasoning, multimodal, dialogue AI, code generation, and agent tasks. Among them, Nova2Lite is designed for everyday workloads, supporting text, image, and video input and generating text output. It is a fast and economical reasoning model.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

NVIDIA Open Sources Audio2Face Model AI Assists in Real-Time Facial Animation Generation

AIbase基地

This article is from AIbase Daily

AI News Recommendations

KlingAI Avatar 2.0 Launches and Immediately Becomes a Hit: Singing and Dancing in 5 Minutes with One Click, Digital Humans Officially Bid Farewell to the Stiff Expression Era

AI Daily: Kuaishou Keling 2.6 Fully Launched; ByteDance Seedream 4.5 Released

AI Daily: Kuaishou Colosseum 2.6 Fully Launched; ByteDance Seedream 4.5 Released; DeepSeek Unveils Two New Models

Microsoft Lowers AI Sales Targets, Sales Staff Face Major Challenges

Beijing Consumer Association Joins Forces with 8 Platforms to Set Six Compliance Red Lines for AI, Strictly Prohibiting AI Face-Swapping and Impersonation for Sales

NVIDIA CEO Jensen Huang: In the Next Two or Three Years, About 90% of New Global Knowledge Will Be AI-Generated

AI Model Discovers Smart Contract Vulnerabilities, Simulated Attack Losses Reach $4.6 Million

Kuaishou Keling 2.6 Now Fully Released! Audio and Video Created Together, Video, Natural Voice, Matching Sound Effects, Environmental Atmosphere

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

NVIDIA Open Sources Audio2Face Model AI Assists in Real-Time Facial Animation Generation

AIbase基地

This article is from AIbase Daily

AI News Recommendations

KlingAI Avatar 2.0 Launches and Immediately Becomes a Hit: Singing and Dancing in 5 Minutes with One Click, Digital Humans Officially Bid Farewell to the Stiff Expression Era

AI Daily: Kuaishou Keling 2.6 Fully Launched; ByteDance Seedream 4.5 Released

AI Daily: Kuaishou Colosseum 2.6 Fully Launched; ByteDance Seedream 4.5 Released; DeepSeek Unveils Two New Models

Microsoft Lowers AI Sales Targets, Sales Staff Face Major Challenges

Beijing Consumer Association Joins Forces with 8 Platforms to Set Six Compliance Red Lines for AI, Strictly Prohibiting AI Face-Swapping and Impersonation for Sales

NVIDIA CEO Jensen Huang: In the Next Two or Three Years, About 90% of New Global Knowledge Will Be AI-Generated

AI Model Discovers Smart Contract Vulnerabilities, Simulated Attack Losses Reach $4.6 Million

Kuaishou Keling 2.6 Now Fully Released! Audio and Video Created Together, Video, Natural Voice, Matching Sound Effects, Environmental Atmosphere

Amazon Launches Nova 2 Series Models, AI Performance Reaches New Heights!

Amazon Launches New Nova 2 Model Family with Comprehensive Technological Advantages

GEO Services