AIBase
Home
AI NEWS
AI Tools
AI Models
MCP
AI Services
AI Compute
AI Tutorial
EN

AI News

View More

Bidirectional Audio-Visual Separation: Tongyi Lab Releases PrismAudio to Let AI Understand Videos and Revoice Them

Tongyi Lab of Alibaba has launched the PrismAudio framework, which solves the issue of audio-video desynchronization in AI video generation. The technology introduces a 'chain-of-thought' mechanism, analyzing video content first and then generating matching sound effects to enhance immersion. The research has been accepted by ICLR 2026.

8.7k 9 minutes ago
Bidirectional Audio-Visual Separation: Tongyi Lab Releases PrismAudio to Let AI Understand Videos and Revoice Them
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map