OpenAI introduces new voice intelligence features in its API
The company announced new models that enable talking, transcribing, and translating conversations in real time.

OpenAI revealed this Thursday a series of voice intelligence features in its API, aimed at helping developers build apps that can converse, transcribe, and translate live conversations. The key addition is GPT‑Realtime‑2, a voice model designed to produce realistic vocal simulations capable of handling complex user requests with reasoning similar to GPT‑5.
They also launched GPT‑Realtime‑Translate, a tool offering instant translation across more than 70 input languages and 13 output languages, enabling seamless multilingual conversations. Additionally, GPT-Realtime-Whisper provides live speech-to-text capabilities, capturing interactions as they happen with high accuracy.
OpenAI explained that these models move beyond simple call-and-response interactions toward voice interfaces that can listen, reason, translate, transcribe, and take action during a conversation. The company highlighted potential uses in customer service, education, media, events, and creator platforms.
Of course, they acknowledged the risks of misuse and said they have built safeguards to prevent spam, fraud, or harmful content, including triggers to halt conversations that violate content guidelines.
The new features shift real-time audio from basic responses toward voice interfaces that can listen, reason, translate, transcribe, and act during conversations.
These advancements expand what’s possible with OpenAI’s Realtime API, opening new opportunities for interaction across various industries and applications involving AI-powered voice systems.
Article topics
Related articles

Google Launches Gemma 4 12B: Local AI for Your Laptop with 16GB RAM
Google's new artificial intelligence model aims to democratize access to generative AI, allowing it to run on average consumer computers.

Nvidia Challenges Intel and AMD with RTX Spark Superchip for PCs
Nvidia introduced RTX Spark, a processor promising to bring advanced artificial intelligence directly to your PC, without cloud dependence, and boost gaming to unprecedented levels on conventional machines.

Anthropic's Claude Opus 4.8 boosts "honesty" and reduces code flaws
Anthropic's new AI model, Claude Opus 4.8, launches this Thursday with a focus on transparency and error reduction, giving users more control over computational effort.
Latest news
View all
Stuntman Hollywood: Returns After 19 Years to PS5, Xbox Series, and PC
The iconic action and vehicular stunt franchise makes its comeback courtesy of Saber Interactive, promising a dose of nostalgia and adrenaline for the new generation.

NASA's Maven Mars Orbiter Declared Out of Service After Six Months of Silence
Following an anomaly that disrupted its orbit and depleted its batteries, the Maven spacecraft, vital for understanding Mars' atmosphere, has ended its active mission. Its scientific data remains an invaluable legacy.

Windows Drops NTLM: Microsoft Boosts Security with Kerberos
Microsoft is taking a crucial step to bolster security in Windows 11, announcing the deprecation of NTLM, its oldest authentication protocol, in favor of Kerberos.
Comments (0)
No comments yet. Be the first!
Leave a comment