MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

AIbase基地

Published inAI News · 3 min read · Nov 20, 2025

Fudan University's MOSS team has launched MOSS-Speech, the first end-to-end Speech-to-Speech dialogue system. The model is now available on Hugging Face Demo and has open-sourced weights and code. MOSS-Speech adopts a "layer splitting" architecture: it freezes the original MOSS text large model parameters and adds three new layers for speech understanding, semantic alignment, and neural vocoder, which can complete speech question answering, emotion imitation, and laughter generation in one go, without the need for a three-stage pipeline of ASR→LLM→TTS.

Evaluation results show that MOSS-Speech reduces WER to 4.1% in the ZeroSpeech2025 textless speech task, with an emotional recognition accuracy of 91.2%, both exceeding Meta's SpeechGPT and Google AudioLM; the subjective MOS score for Chinese spoken language tests reached 4.6, close to human recordings at 4.8. The project provides a 48kHz super-sampling version and a 16kHz lightweight version, the latter of which can perform real-time inference on a single RTX4090 with a latency of less than 300ms, suitable for mobile deployment.

The team revealed that they will soon open-source the "Speech Control" version of MOSS-Speech-Ctrl, supporting dynamic adjustment of speech speed, voice, and emotional intensity through voice commands, expected to be released in Q1 2026. MOSS-Speech has opened commercial licenses, and developers can obtain training and fine-tuning scripts via GitHub to complete private voice cloning and character voice conversion locally.

MOSS-Speech AI New Words Fudan University Speech Large Model

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Kuaishou AI Glasses Partner with Gaode to Enhance Travel Services, Gradually Integrating Features like Street Scanning Rankings and Ride-Hailing

Quark AI glasses and Amap enhance collaboration with new features like 'navigation screen projection', enabling voice or app-initiated navigation for seamless travel experience.....

Nov 20, 2025

140

NetEase Reports Revenue of 28.4 Billion Yuan in the Third Quarter: Sales of AI Subscription Services Reach New Highs

Today, NetEase announced its unaudited financial results for the third quarter of 2025. The data shows that NetEase achieved a revenue of 28.4 billion yuan (approximately 4 billion US dollars) in this quarter, representing an 8.2% increase compared to the same period in 2024, demonstrating a steady growth trend. In terms of gross profit, NetEase achieved 18.2 billion yuan (approximately 2.6 billion US dollars), an increase of 10.3% year-over-year, indicating further improvement in profitability. Regarding cost control, NetEase's total operating expenses amounted to 10.2 billion yuan (1.4 billion US dollars), an increase of 8.9% compared to the same period in 2024.

Nov 20, 2025

120

AI Daily: Meta Opens Source Interactive 3D Model SAM 3D; Lenovo to Launch Personal Super Agent; Warner Music Reaches Copyright Settlement with Udio

Volcano Engine leads China in execution ability and ranks fifth globally in Gartner's AI Platform Magic Quadrant, entering the Challengers quadrant as No. 1 with Doubao model and Volcano Ark platform.....

Nov 20, 2025

140

Lenovo to Launch Personal Super AI Agent, Yang Yuanqing Does Not Believe in an Artificial Intelligence Bubble

Lenovo Group announced the results for its second fiscal quarter of the 2025/26 fiscal year, ending on September 30, 2025. The data shows that Lenovo's revenue increased by 15% year-over-year, reaching 146.4 billion yuan, setting a new record for the quarter; adjusted net profit increased by 25% year-over-year, reaching 3.66 billion yuan, demonstrating strong growth momentum. Looking at the business segments, all of Lenovo's business groups achieved significant growth. Among them, the IDG Smart Devices Business Group generated revenue of 108.1 billion yuan, an increase of 12% year-over-year, and maintained a strong position in the global PC market.

Nov 20, 2025

130

Google Maps Gemini Upgrade: AI Travel Guide + Landmark Navigation + Charging Station Prediction

Google Maps integrates Gemini AI, adding features: pre-trip insights from reviews and web data, landmark-based navigation using street view, available on iOS/Android with Android Auto expansion.....

Nov 20, 2025

110

Lei Jun Promises to Do Three Things: Promote the Deep Integration of AI and Intelligent Manufacturing

Today, Lei Jun, founder of Xiaomi Group, published a long article through public channels, enthusiastically celebrating the important milestone of the 500,000th complete vehicle being rolled off the production line for Xiaomi cars. In the article, Lei Jun could not hide his excitement, stating: 'Today, we have reached a significant milestone in the development of Xiaomi cars — the 500,000th car has been officially rolled off the production line.' From the release of the first car to the successful roll-out of the 500,000th vehicle, Xiaomi cars have achieved this in just 1 year and 7 months, a remarkably rapid development speed that has attracted attention. Lei Jun candidly admitted that 500,000 units produced is considered a significant achievement even for industry giants.

Nov 20, 2025

160

QQ Browser PC Version v19.8.5 Released: AI + Floating Window Features Fully Upgraded

The PC version of QQ Browser has received a major update with version v19.8.5. This upgrade not only achieved significant breakthroughs in AI features, but also carried out deep optimization in user experience, providing users with an unprecedented smart browsing experience. In terms of menu and function area layout, QQ Browser has made bold reforms. Previously hidden deep within the menus, requiring multiple clicks to find common tools such as bookmarks and history records, have now been moved to the top of the menu, enabling convenient one-click access. Users can also customize according to their personal usage habits.

Nov 20, 2025

110

Musk says the development of AI will make money meaningless, humans won't need to work in the future

On Monday local time, at the important event of the Masa Investment Forum, the chairman of Tesla and SpaceX, Musk, appeared on stage with Huang Renxun, CEO of NVIDIA, and put forward a widely discussed viewpoint. Musk said that the rapid development of generative AI could very likely make money lose its original significance. He further explained that although energy and mass would still be limiting factors, money would eventually become insignificant in the future world. When talking about future work models, Musk showed forward-thinking ideas.

Nov 20, 2025

110

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

Gartner released its first Magic Quadrant for AI Development Platforms, with Volc Engine ranked as the top challenger. It has the fifth strongest on-premise capabilities globally and first in China. Its strengths lie in a complete loop of models, tools, computing power, and scenarios, enabling leading customers in industries such as consumer goods and finance to quickly build multimodal applications. By the first half of 2025, Volc Engine's market share in large model services on public clouds in China reached 49.2%, capturing nearly half of the Chinese market.

Nov 20, 2025

110

Warner Music Reaches Copyright Settlement with Udio and Launches AI Music Creation Platform

Warner Music partners with AI firm Udio to launch a licensed AI music platform in 2026, using authorized content to generate new revenue for artists while protecting copyrights.....

Nov 20, 2025

120

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

AI Models Finder

LLM Leaderboard

Model Providers

Submit Your Model

Compare LLMs

LLM Cost Calculator

LLM Arena

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

AI Brand Monitoring Tool

GEO Services​

AI Search Visibility Checker

AI Model Compatibility Checker

AI Deployment Calculator

AI Dataset Collection

Intelligent Document Recognition

MOSS-Speech Open Source: China's First Speech-to-Speech Large Model, Bypassing Text Intermediate

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Kuaishou AI Glasses Partner with Gaode to Enhance Travel Services, Gradually Integrating Features like Street Scanning Rankings and Ride-Hailing

NetEase Reports Revenue of 28.4 Billion Yuan in the Third Quarter: Sales of AI Subscription Services Reach New Highs

AI Daily: Meta Opens Source Interactive 3D Model SAM 3D; Lenovo to Launch Personal Super Agent; Warner Music Reaches Copyright Settlement with Udio

Lenovo to Launch Personal Super AI Agent, Yang Yuanqing Does Not Believe in an Artificial Intelligence Bubble

Google Maps Gemini Upgrade: AI Travel Guide + Landmark Navigation + Charging Station Prediction

Lei Jun Promises to Do Three Things: Promote the Deep Integration of AI and Intelligent Manufacturing

QQ Browser PC Version v19.8.5 Released: AI + Floating Window Features Fully Upgraded

Musk says the development of AI will make money meaningless, humans won't need to work in the future

Volc Engine Ranks First in China and Fifth Globally in Gartner's Report on On-Premise Capabilities

Warner Music Reaches Copyright Settlement with Udio and Launches AI Music Creation Platform

GEO Services