OpenAI Upgrades AI Agent Development Tools to Support TypeScript and Improves Voice Interaction

AIbase基地

Published inAI News · 4 min read · Jun 4, 2025

OpenAI announced a series of significant upgrades to its AI agent development tools. These updates not only enhance the platform's compatibility but also optimize the voice interface while improving observability, allowing developers to build AI agents more efficiently.

OpenAI has added TypeScript support to its Agents SDK. This move enables developers in JavaScript and Node.js environments to participate in agent development. The new version maintains consistency with the previous Python version, featuring core components such as Handoffs (task handover mechanism), Guardrails (runtime behavior constraints), and Tracing (execution tracking). Additionally, the Model Context Protocol (MCP) ensures smooth transmission of context information during execution, enabling developers to seamlessly build agents in both frontend browsers and backend Node.js environments.

OpenAI introduced the RealtimeAgent feature to support low-latency voice applications. This feature integrates audio input/output, state interaction, and interruption handling functions, introducing a human-in-the-loop (HITL) approval mechanism. Developers can pause execution during agent operations, allowing the system to check the current status and continue execution only after manual confirmation. This mechanism is particularly suitable for scenarios requiring supervision and compliance checks, ensuring controllable agent behavior.

OpenAI also upgraded the Traces dashboard to track sessions for the Realtime API. The updated dashboard now covers audio input/output, tool calls, and user interruptions, providing unified audit records to simplify debugging and performance optimization processes.

OpenAI also improved the speech-to-speech model to reduce latency, enhance conversational naturalness, and improve interruption handling capabilities. After the update, the system can achieve faster streaming responses, more expressive audio generation, and robust handling of overlapping inputs, laying the foundation for building dynamic multimodal conversational agents.

Key Highlights:
🌟 TypeScript Support: OpenAI’s Agents SDK now supports TypeScript, expanding the developer ecosystem and making it easier for developers from different environments to use it.
🎤 RealtimeAgent Feature: This new feature supports low-latency voice applications, allowing developers to pause execution and manually confirm the agent's status during operation.
🔍 Speech Model Improvements: The speech-to-speech model was optimized to reduce latency, improve conversational naturalness, and enhance interruption handling capabilities.

Baidu's New Generation Digital Human Technology NOVA Makes Its Debut at WAIC, Expected to Be Opened in October

At the 2025 World Artificial Intelligence Conference (WAIC), Baidu presented multiple innovative achievements and latest developments in the field of artificial intelligence. During the conference, Baidu announced that its Apollo Go, PaddlePaddle Deep Learning Platform, and Baidu Intelligent Computing Cluster have been selected for the China Artificial Intelligence Industry Innovation Exhibition. Among them, Apollo Go not only exhibited as an item but also served as a shuttle vehicle for the conference, demonstrating the mature application of its autonomous driving technology. Currently, Apollo Go has provided over 11 million travel services worldwide, with safety driving mileage exceeding 1

2025 World Artificial Intelligence Conference Opens, Alibaba Unveils Its First Qwen AI Glasses

Today, the 2025 World Artificial Intelligence Conference (WAIC) officially kicked off, and Alibaba announced the technical progress of its first self-developed AI glasses - 'Qwen AI Glasses' at this event, and showcased a real machine on site. From the current display, the Qwen AI Glasses do not have a screen, but instead use a combination of lenses and voice interaction, which is considered one of the more suitable forms of AI carriers at present.

AI Daily: The Web Design Feature of Kousi Space is Now Available; Alibaba Wan 2.2 is About to Launch; OpenAI is About to Release GPT-5

AI updates: Coze Space launches 5-min web design; Alibaba's Qwen-MT supports 92 languages; ChatGPT agents now available; Alibaba Wan2.2 adds text-to-video; Anthropic releases AI audit tool; OpenAI plans GPT-5 in August; Google launches no-code Opal; PhysX-3D adds physics to 3D models; Kuaishou opens KAT-V1 model; iFlytek upgrades Spark X1.....

Figma Make Opens to All Users: AI-Powered Design, Efficiency Within Reach

Figma launches AI design tool 'Make' for all users, enabling natural language prototyping. Basic features are free; 'Full Seat' unlocks full access with unlimited AI credits. Supports image uploads for AI generation and offers editing tools. Integrates image generation and enhancement, forming a comprehensive design ecosystem.....

Product Finder

Product Submit

AI Models Finder

MCP Servers

MCP Client

MCP Inspector

Case Tutorials

Latest AI News

AI Daily Brief

OpenAI Upgrades AI Agent Development Tools to Support TypeScript and Improves Voice Interaction

AIbase基地

This article is from AIbase Daily

AI News Recommendations

Baidu's New Generation Digital Human Technology NOVA Makes Its Debut at WAIC, Expected to Be Opened in October

2025 World Artificial Intelligence Conference Opens, Alibaba Unveils Its First Qwen AI Glasses

Trickle Magic Canvas Launch: No-Code! Co-create Production-Grade Applications with AI, Revolutionizing the Future of Development!

AI Daily: The Web Design Feature of Kousi Space is Now Available; Alibaba Wan 2.2 is About to Launch; OpenAI is About to Release GPT-5

Tesla Emphasizes the Safety of Assisted Driving: AI Hardware Support

Memories AI Launches the World's First Artificial Intelligence Visual Memory Model and Secures $8 Million in Seed Funding

MyShell ShellAgent 2.0 Launch: Create an App with One Sentence, the AI Revolution Without Frontend is Coming

Zhejiang University Alumni Launch an AI Code Testing Tool, Create a Bug-Free Website in 30 Minutes

Figma Make Opens to All Users: AI-Powered Design, Efficiency Within Reach

Google Lab's Powerful New Product Opal: No-Code! Build AI Applications with Natural Language to Unlock Future Productivity