Microsoft Open Sources VibeVoice-1.5B Model: New Breakthrough in 90-Minute Ultra-Long Speech Synthesis
Microsoft open-sourced VibeVoice-1.5B, a breakthrough audio model for speech synthesis. It can generate 90-minute ultra-long speech in one go, surpassing the previous 60-minute limit, while effectively addressing timbre drift and semantic discontinuity. It supports up to 4 speakers, delivering more natural and superior speech.....