Xiaohongshu Launches the Next-Generation Dialogue Synthesis Model FireRedTTS-2 to Aid in AI Podcast Production

AIbase基地

Published inAI News · 4 min read · Sep 15, 2025

Xiaohongshu ZhiChuang Audio Technology Team recently launched the next-generation dialogue synthesis model FireRedTTS-2, marking another significant advancement in dialogue generation technology. The model aims to address some pain points in existing dialogue synthesis solutions, such as poor flexibility, frequent pronunciation errors, unstable speaker switching, and insufficient prosody naturalness.

FireRedTTS-2 upgrades its core modules, especially the discrete speech encoder and text-to-speech synthesis model, to comprehensively improve the synthesis effect. In multiple objective and subjective evaluations, FireRedTTS-2 has shown industry-leading performance, providing a better solution for multi-speaker dialogue synthesis. Its technical report has been published on arXiv and can be experienced through a dedicated Demo and code link.

A notable feature of FireRedTTS-2 is its naturalness in synthesis. The model can accurately capture details such as stress, emotion, and pauses, resulting in natural and smooth audio quality. Compared to closed-source dialogue generation models, FireRedTTS-2 not only can generate high-quality podcast audio but also supports voice cloning. By providing just one sentence of speech sample from each speaker, the model can imitate their voice and speaking habits to automatically generate entire dialogues. This function makes it highly competitive in the open-source dialogue generation field.

During training, FireRedTTS-2 supports multiple languages, including Chinese, English, Japanese, Korean, and French. It also uses a low-frame-rate discrete speech encoder to improve synthesis speed and stability. Additionally, the dual Transformer model architecture makes the synthesized speech more natural and coherent. Moreover, FireRedTTS-2 requires only a small amount of data to achieve voice customization, quickly adapting to different application scenarios.

The release of FireRedTTS-2 not only provides an industrial-grade solution for AI podcasts and dialogue synthesis applications, but also opens up new possibilities for innovation inside and outside the industry. In the future, the team will continue to optimize the model, increase the number of supported speakers and languages, and explore more controllable sound effects insertion features to meet growing market demands.

Code link: https://github.com/FireRedTeam/FireRedTTS2

Key Points:
🎤 FireRedTTS-2 is the next-generation dialogue synthesis model launched by Xiaohongshu ZhiChuang Audio Technology Team, aiming to enhance synthesis quality and naturalness.
🗣️ The model has voice cloning capabilities, generating natural multi-speaker dialogues with only a small amount of samples.
🌐 Supports multiple languages and low-frame-rate discrete speech encoders, improving synthesis speed and stability, suitable for various application scenarios.

Invisible Raises $1 Billion in Funding to Help Companies Build AI Training Platforms

The San Francisco-based startup Invisible Technologies recently announced the completion of a $1 billion funding round. This round was led by the newly established investment firm Vanara Capital, with other participating companies including Princeville Capital, HOF Capital, Acrew Capital, Greycroft, and Deepwater Asset Management. This round

Tencent HunyuanImage 2.1 Makes a Strong Debut! Open-Source 2K Text-to-Image Champion, Turns into High-Resolution Art Master in an Instant?

Recently, the Tencent Hunyuan team officially open-sourced HunyuanImage2.1, this 17B parameter DiT (Diffusion Transformer) text-to-image model quickly topped the Artificial Analysis Image Arena ranking, surpassing HiDream-I1-Dev and Qwen-Image, becoming the new leader in open-source weight models. The model supports native 2048x2048 resolution output, and significantly improves

AI Daily: Xiaohongshu Launches Dialogue Synthesis Model FireRedTTS-2; Baidu Wenxin New Model Tops Hugging Face; xAI to Lay Off 500 People

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technical trends and innovative AI product applications. Fresh AI products click to learn more: https://app.aibase.com/zh1, Xiaohongshu launches its next-generation dialogue synthesis model FireRedTTS-2, helping with AI podcast production. FireRedTTS-2 is the new model developed by Xiaohongshu's intelligent audio technology team.

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services

Xiaohongshu Launches the Next-Generation Dialogue Synthesis Model FireRedTTS-2 to Aid in AI Podcast Production

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI System Defeats Humans and Google in Global Top Programming Competition

Invisible Raises $1 Billion in Funding to Help Companies Build AI Training Platforms

Albania Introduces the World's First AI Minister, Striving for a Clean Government

Google Launches New AI Agent Purchase Protocol AP2, Making Shopping Easier for Smart Assistants

Figure Humanoid Robot Completes Over $1 Billion in Series C Funding, Valuation Reaches $39 Billion

OpenAI Launches New Safety Measures for Teenagers Using ChatGPT

Tencent HunyuanImage 2.1 Makes a Strong Debut! Open-Source 2K Text-to-Image Champion, Turns into High-Resolution Art Master in an Instant?

AI Daily: Xiaohongshu Launches Dialogue Synthesis Model FireRedTTS-2; Baidu Wenxin New Model Tops Hugging Face; xAI to Lay Off 500 People

AI Composer Has Arrived! MiniMax Releases Music 1.5, One-Click Generation of a 4-Minute Full Song, Classical Chinese Music Can Be Mastered Too

DevRev Launches AI Assistant Computer Aiming to Break Enterprise Software Data Silos

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

GEO Services​

Xiaohongshu Launches the Next-Generation Dialogue Synthesis Model FireRedTTS-2 to Aid in AI Podcast Production

AIbase基地

This article is from AIbase Daily

AI News Recommendations

OpenAI System Defeats Humans and Google in Global Top Programming Competition

Invisible Raises $1 Billion in Funding to Help Companies Build AI Training Platforms

Albania Introduces the World's First AI Minister, Striving for a Clean Government

Google Launches New AI Agent Purchase Protocol AP2, Making Shopping Easier for Smart Assistants

Figure Humanoid Robot Completes Over $1 Billion in Series C Funding, Valuation Reaches $39 Billion

OpenAI Launches New Safety Measures for Teenagers Using ChatGPT

Tencent HunyuanImage 2.1 Makes a Strong Debut! Open-Source 2K Text-to-Image Champion, Turns into High-Resolution Art Master in an Instant?

AI Daily: Xiaohongshu Launches Dialogue Synthesis Model FireRedTTS-2; Baidu Wenxin New Model Tops Hugging Face; xAI to Lay Off 500 People

AI Composer Has Arrived! MiniMax Releases Music 1.5, One-Click Generation of a 4-Minute Full Song, Classical Chinese Music Can Be Mastered Too

DevRev Launches AI Assistant Computer Aiming to Break Enterprise Software Data Silos

GEO Services