OpenAI 发布 GPT-Realtime-2 语音模型,具备 GPT-5 级推理能力,可在对话中实时协作。同时推出流式模型 GPT-Realtime-Translate 和 GPT-Realtime-Whisper,扩展音频能力。
Introducing GPT-Realtime-2 in the API: our most intelligent voice model yet, bringing GPT-5-class reasoning to voice agents.
Voice agents are now real-time collaborators that can listen, reason, and solve complex problems as conversations unfold.
Now available in the API alongside streaming models GPT-Realtime-Translate and GPT-Realtime-Whisper — a new set of audio capabilities for the next generation of voice interfaces.
likes: 11317 | retweets: 1038 | replies: 500 | views: 1734987