AIBase
Home
AI NEWS
AI Tools
GEO & AEO
MCP
AI Models
AI Marketplace
EN

AI News

View More

Bidirectional Audio-Visual Separation: Tongyi Lab Releases PrismAudio to Let AI Understand Videos and Revoice Them

Tongyi Lab of Alibaba has launched the PrismAudio framework, which solves the issue of audio-video desynchronization in AI video generation. The technology introduces a 'chain-of-thought' mechanism, analyzing video content first and then generating matching sound effects to enhance immersion. The research has been accepted by ICLR 2026.

14.5k 22 hours ago
Bidirectional Audio-Visual Separation: Tongyi Lab Releases PrismAudio to Let AI Understand Videos and Revoice Them
AIBase
Empowering the future, your artificial intelligence solution think tank
English简体中文繁體中文にほんご
FirendLinks:
AI Newsletters AI ToolsMCP ServersAI NewsAIBaseLLM LeaderboardAI Ranking
© 2026AIBase
Business CooperationSite Map