OpenAI said Thursday that its API will now include a number of new voice intelligence features designed to help developers create apps that can talk, transcribe, and translate conversations with users ...
OpenAI has introduced three new real-time voice models—GPT‑Realtime‑2 for reasoning, GPT‑Realtime‑Translate for live translation, and GPT‑Realtime‑Whisper for transcription—via its Realtime API. The ...
OpenAI is moving away from models that require heavy hand-holding and toward systems that can better infer the user’s goal, ...
GPT-Realtime-2 brings GPT-5-class reasoning to live voice. A separate translation model covers 70+ input languages. A streaming Whisper variant handles transcription. The pricing is aggressive enough ...
OpenAI released a new generation of voice models in its API on Wednesday, giving developers tools to build apps that can reason through spoken requests, translate across +70 languages, and transcribe ...
May 7 (Reuters) - OpenAI introduced three audio models for its developer platform on Thursday, aiming to make voice-based software agents more ‌conversational and capable of completing tasks in real ...
Voice agents have been expensive to run and painful to orchestrate, not because the models can't handle conversation, but because context ceilings forced enterprises to build session resets, state ...
Stackpack is the first platform to unify AI token consumption, model attribution, and per-employee usage across ...