Aliyun Open Sources ThinkSound: AI Automatically Adds Sound Effects to Videos, Bringing a Major Transformation to Film and Game Creation!
Alibaba open sources the audio generation model ThinkSound, which supports multimodal inputs such as video, text, and audio, and can automatically generate high-fidelity sound effects that highly match the visuals. The model uses chain reasoning technology to achieve precise synchronization between audio and video, and is applicable to fields such as film and games. As an open source project, ThinkSound lowers the barriers to sound effect creation, and developers can freely access it through multiple platforms. This is Alibaba's latest breakthrough in the field of multimodal AI, and will drive the development of sound generation technology.