Microsoft Opensources VibeVoice TTS Model: 90-Minute Ultra-Long Speech, Can Support 4-Person Dialogue, Chinese Performance is Stunning!

AIbase基地

Published inAI News · 5 min read · Aug 26, 2025

Recently, Microsoft launched a highly anticipated open-source text-to-speech (TTS) model called VibeVoice, which has attracted significant attention in the AI voice technology field. With its powerful features and outstanding performance, this model has set a new benchmark for long-form speech generation, multi-person dialogue, and Chinese speech synthesis. Below, AIbase will provide you with a detailed analysis of the highlights and potential of VibeVoice.

Supports 90-minute ultra-long speech generation, breaking time limits

VibeVoice has made a major breakthrough in the duration of speech generation, capable of generating continuous speech up to 90 minutes at one time. This feature is particularly suitable for scenarios requiring long audio output, such as podcasts, audiobooks, and educational content production. Compared to the time limitations of traditional TTS models, VibeVoice's ultra-long generation capability offers content creators greater flexibility and creative space.

New heights in multi-person dialogue, supporting up to four people's voices

Different from previous TTS models that were limited to single or two-person dialogues, VibeVoice can generate smooth conversations involving up to four people. This feature performs exceptionally well in scenarios such as simulating multi-person podcasts, meeting recordings, or virtual character interactions. Thanks to its optimization in speech consistency and natural turn-taking, the multi-person dialogue generated by VibeVoice is natural and smooth, almost comparable to real human recordings.

Excellent Chinese speech effects, promoting localized applications

For the Chinese market, VibeVoice has demonstrated impressive performance. It supports Chinese speech synthesis and achieves a high level in tone, pronunciation accuracy, and naturalness. This makes VibeVoice have broad application potential in areas such as Chinese podcasts, education and training, and intelligent customer service, providing developers with a high-quality localized voice solution.

Supports background music, creating an immersive podcast experience

Another highlight of VibeVoice is its ability to generate podcasts with background music. This feature allows content creators to easily add background sound effects to their voice, creating more immersive and professional audio content. Whether it's a relaxing background melody or a tense atmosphere sound effect, VibeVoice can seamlessly integrate them, offering listeners a richer auditory experience.

Open source empowers developers, with broad future application prospects

As an open-source model, VibeVoice was officially released on GitHub on August 26, 2025, allowing developers to freely obtain and perform secondary development. Microsoft's move to open-source this model not only lowers the threshold for using high-quality TTS technology but also injects new vitality into the global AI developer community. Whether individual creators or enterprise users, they can quickly build innovative voice applications through VibeVoice.

Address: https://huggingface.co/microsoft/VibeVoice-1.5B

Microsoft's FY2026 Report: AI Strategy Yields Significant Results, OpenAI and Anthropic Deliver Substantial Returns

Microsoft's latest financial report shows that its AI strategy is entering a period of returns. In the previous quarter, the investment in OpenAI brought $7.6 billion in net profit, effectively responding to external doubts. The deep collaboration between the two companies has been transformed into a stable cash flow. After OpenAI restructured as a non-profit company, it reached a $250 billion Azure cloud service purchase agreement with Microsoft.

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

Samsung Electronics posted strong 2025 results, driven by surging chip demand from the global AI race. Full-year operating profit rose 33.2% to KRW 43.6 trillion, with sales up 10.9% to KRW 333.6 trillion and net profit increasing 31.2% to KRW 45.2 trillion. Q4 performance was particularly strong, with operating profit hitting a record high.....

AI Investment Enters Harvesting Phase: Microsoft's Net Profit Surges by $7.6 Billion, OpenAI Contributes Nearly Half of the Backlog Orders

Microsoft's fiscal year 2026 second-quarter financial report shows that the company's net income increased significantly by $7.6 billion due to its investment in OpenAI. The cooperation between the two companies has deepened, and Microsoft's commercial remaining performance obligations have grown significantly, reflecting the substantial returns from long-term collaboration.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Brand Visibility

AI Brand Monitoring Tool

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

AI Model Compatibility Checker

AI Deployment Calculator

Microsoft Opensources VibeVoice TTS Model: 90-Minute Ultra-Long Speech, Can Support 4-Person Dialogue, Chinese Performance is Stunning!

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Altman's Start of the Year Sales: The Main Model is the Thin iPhone Air

Giant Collaboration: NVIDIA, Amazon, and Microsoft Plan to Invest $60 Billion in OpenAI

Microsoft's FY2026 Report: AI Strategy Yields Significant Results, OpenAI and Anthropic Deliver Substantial Returns

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

AI Hallucinations Cause Diplomatic Controversy: UK Police Emergency Disable Microsoft Copilot

AI Investment Enters Harvesting Phase: Microsoft's Net Profit Surges by $7.6 Billion, OpenAI Contributes Nearly Half of the Backlog Orders

Efficiency First! Sam Altman Says AI Has Helped OpenAI Significantly Slow Down Hiring

The Powerhouse of Computing Evolves Again! Microsoft Unveils Maia 200 Inference Chip with Over 100 Billion Transistors Targeting Large-Scale AI

Microsoft Releases AI Chip Maia200 to Accelerate Reduction of Reliance on NVIDIA

Rejecting Text Overload: Three Former Google Executives Create Sparkli to Reshape Children's Interactive Education with Generative AI

AI News Recommendations

Altman's Start of the Year Sales: The Main Model is the Thin iPhone Air

Giant Collaboration: NVIDIA, Amazon, and Microsoft Plan to Invest $60 Billion in OpenAI

Microsoft's FY2026 Report: AI Strategy Yields Significant Results, OpenAI and Anthropic Deliver Substantial Returns

AI Chip Shortage: Samsung's Operating Profit Rises 200% in Q4 2025, Setting a New Historical High

AI Hallucinations Cause Diplomatic Controversy: UK Police Emergency Disable Microsoft Copilot

AI Investment Enters Harvesting Phase: Microsoft's Net Profit Surges by $7.6 Billion, OpenAI Contributes Nearly Half of the Backlog Orders

Efficiency First! Sam Altman Says AI Has Helped OpenAI Significantly Slow Down Hiring

The Powerhouse of Computing Evolves Again! Microsoft Unveils Maia 200 Inference Chip with Over 100 Billion Transistors Targeting Large-Scale AI

Microsoft Releases AI Chip Maia200 to Accelerate Reduction of Reliance on NVIDIA

Rejecting Text Overload: Three Former Google Executives Create Sparkli to Reshape Children's Interactive Education with Generative AI

GEO Services