Microsoft officially launches GPT-realtime model, focusing on more realistic voice and multimodal input

AIbase基地

Published inAI News · 3 min read · Sep 5, 2025

Microsoft has officially announced that its latest speech-to-speech (S2S) model, GPT-realtime, has been officially released on the Azure AI Foundry platform. This new model integrates Microsoft's multiple improvements in speech technology into a unified product, with core advantages focusing on natural language processing, excellent audio quality, and more accurate command following capabilities.

Microsoft

Developers can now access GPT-realtime through a new Real-time API. The model is designed to provide more natural and expressive speech output and a higher quality audio experience. As part of this release, Microsoft also introduced two new voice options—Marin and Cedar—intended to offer realistic and clear speech synthesis for users.

In the announcement, Microsoft highlighted several key improvements in the new model, including enhanced function calling capabilities, higher accuracy in command execution, and innovative image input support. This new feature allows users to add images to voice conversations and discuss them, enabling multimodal interaction without relying on video streams.

In addition to technical upgrades, Microsoft also adjusted its pricing model. Compared to the previous gpt-4o-realtime preview version, the official version of gpt-realtime has reduced its price by 20%, with costs calculated based on the usage of per million tokens (token).

This release marks Microsoft's commitment to expanding its real-time AI capabilities for developers and enterprises. By combining expressive speech synthesis, high-quality audio, and multimodal input, GPT-realtime is expected to provide strong technical support for a wide range of applications, from advanced customer support systems to innovative assistive tools.

S2S GPT-realtime AzureAIFoundry Marin

This article is from AIbase Daily

Welcome to the [AI Daily] column! This is your daily guide to exploring the world of artificial intelligence. Every day, we present you with hot topics in the AI field, focusing on developers, helping you understand technical trends, and learning about innovative AI product applications.

—— Created by the AIbase Daily Team

AI News Recommendations

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

At the 2026 Beijing Auto Show, Xincheng Technology launched the 5nm automotive-grade AI cockpit chip Longying 2, with AI computing power reaching 200TOPS and supporting models with over 7B parameters, marking a key breakthrough in advanced process technology and cross-domain integration for high-end domestic vehicle chips.

Apr 27, 2026

100

Stop Using Old Instructions! OpenAI Releases the GPT-5.5 Prompt Guide: Simpler Is Better

OpenAI released GPT-5.5 prompt guidelines, urging developers to shift from lengthy instructions for older models to concise, result-oriented communication. Directly migrating old prompt stacks is counterproductive due to enhanced reasoning, making long instructions that compensated for model limitations now a performance bottleneck.....

Apr 27, 2026

150

Saying Goodbye to Flat Design: Google's Entire Application Icons Undergo a Major Transformation, Gradient Color Design Redefines Visual Aesthetics

Google plans a comprehensive visual upgrade for its core app icons, rolling out officially after a trial in late 2025. The new design replaces rigid pure circles and four-color patches with soft rounded corners and gradient transitions, moving from pastel tones to saturated Google primary colors for a more dynamic, modern look across nearly all core apps.....

Apr 27, 2026

140

OpenAI terminates the Codex product line and fully integrates it into GPT-5.5

OpenAI announced on April 26 that it has terminated the dedicated programming model Codex, integrating its core capabilities into the main GPT-5.5 model. Since GPT-5.4, the dedicated programming branch has disappeared, making GPT-5.3 the last standalone Codex model. This move marks OpenAI's return to a 'generalist' strategy, using a single powerful system to cover all specialized scenarios, including programming.

Apr 27, 2026

260

Hot Trend: Bay Area Mansion Supports Exchange of AI Giant's Shares, Seller Claims AI Configuration Is Too Low

Investment banker Storm Duncan listed a 13-acre mansion in Mill Valley, San Francisco, and publicly requested that buyers pay the purchase price with shares of the AI unicorn Anthropic instead of cash. This move aims to balance asset allocation and highlight the value of tech equity in high-end transactions. Duncan even created a LinkedIn page for the property to attract attention from the technology and investment communities, showcasing the emerging trend of equity-for-property exchanges.

Apr 27, 2026

120

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

Beijing Tiantan Hospital and Yinghe Yimai jointly released 'Dr. Xiaojun 2.0', the world's first full-disease-coverage CT-assisted report generation model for cranial scans. Leveraging Tiantan's massive cranial CT data and Yinghe's foundation model with AI Agent technology, it automates the entire process from image analysis to diagnostic reporting, significantly enhancing neuroimaging diagnostic standards.....

Apr 24, 2026

320

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

Yinghe Yimei and Beijing Tiantan Hospital jointly launched the world's first large model for full-disease coverage in cranial CT auxiliary report generation, "Xiao Jun Doctor 2.0", on April 24 in Beijing. This AI product aims to improve the efficiency and accuracy of medical imaging reports through advanced technology, attracting widespread attention from medical professionals and tech enthusiasts.

Apr 24, 2026

300

Cohere and Aleph Alpha Establish a $2 Billion Transatlantic Artificial Intelligence Partnership

Canadian startup Cohere and German startup Aleph Alpha have formed a $20 billion partnership to develop a 'sovereign' AI system, aiming to create an AI architecture independent of the US and China, advancing transatlantic technological autonomy. Cohere specializes in natural language processing, while Aleph Alpha excels in reasoning models; together, they will combine their technological strengths to accelerate independent AI development.....

Apr 24, 2026

290

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

In the Hong Kong stock market, shares of Zhipu Technology and Minimax fell significantly after the release of Deepseek V4, a highly anticipated deep learning model with technical upgrades and enhanced features. This unexpected downturn in these major AI concept stocks sparked widespread investor discussion.....

Apr 24, 2026

300

Perplexity CEO Says AI Trend Will Strengthen Rather Than Replace the iPhone's Core Position

Perplexity CEO Aravind Srinivas argues AI won't disrupt smartphones but will evolve iPhones into 'digital passports'. As AI relies on context, iPhones storing personal data (payments, health, communication) become critical infrastructure, with Apple's chips as an underestimated advantage.....

Apr 24, 2026

280

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Ranking Optimization

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

Microsoft officially launches GPT-realtime model, focusing on more realistic voice and multimodal input

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

Stop Using Old Instructions! OpenAI Releases the GPT-5.5 Prompt Guide: Simpler Is Better

Saying Goodbye to Flat Design: Google's Entire Application Icons Undergo a Major Transformation, Gradient Color Design Redefines Visual Aesthetics

OpenAI terminates the Codex product line and fully integrates it into GPT-5.5

Hot Trend: Bay Area Mansion Supports Exchange of AI Giant's Shares, Seller Claims AI Configuration Is Too Low

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

Cohere and Aleph Alpha Establish a $2 Billion Transatlantic Artificial Intelligence Partnership

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

Perplexity CEO Says AI Trend Will Strengthen Rather Than Replace the iPhone's Core Position

AI News Recommendations

Another Breakthrough in Domestic Chips! 5nm Longying 2 Chip Officially Released, AI Computing Power Reaches 200TOPS

Stop Using Old Instructions! OpenAI Releases the GPT-5.5 Prompt Guide: Simpler Is Better

Saying Goodbye to Flat Design: Google's Entire Application Icons Undergo a Major Transformation, Gradient Color Design Redefines Visual Aesthetics

OpenAI terminates the Codex product line and fully integrates it into GPT-5.5

Hot Trend: Bay Area Mansion Supports Exchange of AI Giant's Shares, Seller Claims AI Configuration Is Too Low

Yinghe Yimei Collaborates with Beijing Tiantan Hospital to Launch the World's First Comprehensive Cranial CT Auxiliary Report Large Model

The World's First Large Model for Full-Disease Coverage in Cranial CT Auxiliary Report Generation is Launched!

Cohere and Aleph Alpha Establish a $2 Billion Transatlantic Artificial Intelligence Partnership

Hong Kong Stock Market's Large Model Stocks Plunge! Zhipu and Minimax Suffer Heavy Losses After Deepseek V4 Release

Perplexity CEO Says AI Trend Will Strengthen Rather Than Replace the iPhone's Core Position