OpenAI Launches Real-Time API: AI Voice Assistants Can Communicate Like Humans

AIbase基地

Published inAI News · 5 min read · Aug 29, 2025

OpenAI has officially launched its "Realtime API" for production use, marking an important step forward for the company in the field of voice interaction technology. This API is primarily aimed at companies and developers building voice assistants for practical applications such as customer support, education, or personal productivity. The core component is the new GPT-Realtime model. This model can generate and process voice directly, without the traditional text conversion steps, resulting in faster and more natural conversations.

Key Features and Significant Performance Improvements

The new GPT-Realtime model has achieved several technological breakthroughs. It can now capture and understand non-verbal cues such as laughter, switch between different languages smoothly within the same sentence, and adjust tone according to instructions, such as "speaking in a friendly French accent" or "quickly and professionally." In addition, the model introduces two new voices: Cedar and Marin, and optimizes existing voices, further enhancing the user experience.

In benchmark tests, GPT-Realtime performed well, achieving an accuracy rate of 82.8% on Big Bench Audio (higher than 65.6%), 30.5% on MultiChallenge (higher than 20.6%), and 66.5% on ComplexFuncBench (higher than 49.7%). These figures show that the new model has made significant progress in handling complex instructions and multilingual tasks.

OpenAI, ChatGPT, artificial intelligence, AI

Better Integration and Lower Prices

The new API simplifies tool integration, allowing the model to more reliably select and use the correct tools and parameters. Developers can now connect external services via SIP and remote MCP servers and use reusable prompts to save different configurations.

Additionally, the image input feature is now available. Users can send screenshots or photos during a conversation, and the model can reference and understand the content in the image, such as reading text or answering related questions. Developers can flexibly control the range of content the model can see.

For cost control, the new API allows developers to set token limits and streamline long sessions. Additionally, the price of GPT-Realtime has been reduced by 20%. Currently, the cost is $32 per million audio input tokens, $64 per million output tokens, and $0.40 per million cached input tokens.

Safety and Privacy: Protective Measures and User Choices

OpenAI emphasizes that this API can detect and terminate conversations that violate its policies, but also points out that developers should add additional security measures themselves. In terms of data privacy, OpenAI provides specific options allowing EU users to choose to store data within the EU, and has established special privacy rules for enterprise users to ensure data security and compliance.

AI Daily: AI Video Mystery Dark Horse Happy Horse Makes Its Debut; Aisi Technology PixVerse C1 Released; 360 Creates Xiaoshu APP

Welcome to the [AI Daily] column! This is your guide to exploring the world of artificial intelligence every day. Every day, we present you with the latest content in the AI field, focusing on developers, helping you understand technology trends and innovative AI product applications. Click to learn more about new AI products: https://app.aibase.com/zh1, Exceed Seedance 2.0! 8, Zhipei launches GLM-5.1: SWE-bench score leads globally, model price increases by 10%. Zhipei releases a new large model GLM-5

15 Seconds 1080P Synchronized Audio and Video! Aishi Technology PixVerse C1 Launch: High-End Model for the Film Industry Makes a Big Impact

Aishi Technology launches the PixVerse C1, a large model tailored for the film industry, aiming to reshape the film production process. The model supports the generation of up to 15-second 1080P high-definition videos, achieving a leap from single shots to automatic scene transitions. It is now available on the Web and API platforms.

OpenAI was reportedly considering imitating the villain from 'Call of Duty' to provoke conflicts between major powers in order to gain funding

OpenAI internal discussions once considered creating panic about an international AI arms race to encourage governments to invest heavily. Its president proposed using geopolitical tensions, employing a 'prisoner's dilemma' strategy to prompt countries to provide funding to avoid falling behind, contrary to suggestions aimed at preventing AI competitions.

Japanese Scientists Successfully Train Rat Neurons to Perform Real-Time AI Computing Tasks

Japanese researchers trained rat cortical neurons to generate complex temporal signals using real-time machine learning, advancing AI computing applications. Their 'closed-loop reservoir computing' system combines live neurons, microelectrode arrays, and microfluidics, demonstrating biological neurons' information processing potential.....

Latest AI News

AI Daily Brief

AI Product Finder

AI Product Rankings

AI Product Submit

AI Tools Directory

GEO Brand Visibility

AI Visibility Audit

AI Search Visibility Checker

GEO Promotion Link Detection

GEO Ranking Optimization System

GEO Services​

MCP Servers

MCP Client

MCP Case Tutorials

MCP Ranking

MCP Service Submission

MCP Playground

MCP Inspector

LLM API Hub

AI Models Finder

Model Providers

LLM Leaderboard

Compare LLMs

LLM Cost Calculator

LLM Arena

AI Model Compatibility Checker

AI Deployment Calculator

OpenAI Launches Real-Time API: AI Voice Assistants Can Communicate Like Humans

AIbase基地

Key Features and Significant Performance Improvements

Better Integration and Lower Prices

Safety and Privacy: Protective Measures and User Choices

This article is from AIbase Daily

AI News Recommendations

Musk Sues OpenAI, Demands Removal of CEO Altman

AI Daily: AI Video Mystery Dark Horse Happy Horse Makes Its Debut; Aisi Technology PixVerse C1 Released; 360 Creates Xiaoshu APP

The Mind-Reading Technology of Taobao Merchants Is Here! Wanmeng Technology's Magic Cube AI Quality Inspection VOC Enters the Service Marketplace

15 Seconds 1080P Synchronized Audio and Video! Aishi Technology PixVerse C1 Launch: High-End Model for the Film Industry Makes a Big Impact

Mother of GPT-4o Announces Resignation, OpenAI Leadership Faces Further Turmoil

MIIT Releases 'Interim Measures for the Ethical Review and Service of Artificial Intelligence Technology' to Clarify New Regulations on AI Ethical Governance

OpenAI Data Shows 600,000 Health Consultations Emerge Weekly in US Hospitals in Desert Areas

Bezos' New Lab Hires Former OpenAI Co-Founder

OpenAI was reportedly considering imitating the villain from 'Call of Duty' to provoke conflicts between major powers in order to gain funding

Japanese Scientists Successfully Train Rat Neurons to Perform Real-Time AI Computing Tasks

AI News Recommendations

Musk Sues OpenAI, Demands Removal of CEO Altman

AI Daily: AI Video Mystery Dark Horse Happy Horse Makes Its Debut; Aisi Technology PixVerse C1 Released; 360 Creates Xiaoshu APP

The Mind-Reading Technology of Taobao Merchants Is Here! Wanmeng Technology's Magic Cube AI Quality Inspection VOC Enters the Service Marketplace

15 Seconds 1080P Synchronized Audio and Video! Aishi Technology PixVerse C1 Launch: High-End Model for the Film Industry Makes a Big Impact

Mother of GPT-4o Announces Resignation, OpenAI Leadership Faces Further Turmoil

MIIT Releases 'Interim Measures for the Ethical Review and Service of Artificial Intelligence Technology' to Clarify New Regulations on AI Ethical Governance

OpenAI Data Shows 600,000 Health Consultations Emerge Weekly in US Hospitals in Desert Areas

Bezos' New Lab Hires Former OpenAI Co-Founder

OpenAI was reportedly considering imitating the villain from 'Call of Duty' to provoke conflicts between major powers in order to gain funding

Japanese Scientists Successfully Train Rat Neurons to Perform Real-Time AI Computing Tasks

GEO Services