而这次的GPT-Realtime-1.5,更是把这种体验推向了极致。它变得更加懂人话了,甚至能听出你语气里的急躁或者犹豫,那种机器味淡了很多,感觉就像是在跟一个真人打电话。 说到实际应用,这次更新最大的受益者就是语音智能体。你可以用它来开发智能客服、语音助手,甚至是电话销售机器人。
在模型方面,全新的实时模型gpt-realtime-1.5及其配套音频模型已正式发布。它们的核心目标是提高语音指令的可靠性。根据OpenAI的内部测试数据,新模型在数字和字母的转录准确率方面提高了约10%,逻辑音频任务的准确率提高了5%,指令执行的准确率也提高了7%,有效解决了AI在听取关键短语或执行复杂语音指令时出现偏差的问题。
OpenAI has expanded the availability of its GPT-5.3-Codex model to third-party developers via API and Microsoft Foundry.
Agora's Conversational AI Engine offers key enhancements to the Realtime API for more natural communication and interaction. This milestone builds on Agora's partnership with OpenAI, as the Realtime ...
Integration of OpenAI with Twilio’s Communications APIs Will Enable Over 300,000 Customers and more than 10 Million Developers to Create Compelling Voice Experiences SAN FRANCISCO--(BUSINESS ...
OpenAI has launched gpt-realtime, its latest speech-to-speech model, offering higher accuracy, improved instruction-following, and more natural-sounding voices. Back in October 2024, OpenAI announced ...
OpenAI and Microsoft Corp. today introduced two artificial intelligence models optimized to generate speech. OpenAI’s new algorithm, gpt-realtime, is described as its most capable voice model. The AI ...
OpenAI has announced new updates to its Codex and its Agent API, enhancing accessibility, functionality, and safety for developers. These updates include expanded access to Codex, new internet-enabled ...
OpenAI’s new GPT-Realtime model and Realtime API updates bring lifelike voice AI, phone calling, and image input to everyday apps. If you’ve ever wished that talking to an AI felt more like chatting ...
OpenAI has unveiled its latest speech-to-speech artificial intelligence (AI) model, gpt-realtime, designed to generate more vivid and natural voice interactions for real-time applications. Alongside ...
That's why OpenAI's push to own the developer ecosystem end-to-end matters in26. "End-to-end" here doesn't mean only better models. It means the ...
一些您可能无法访问的结果已被隐去。
显示无法访问的结果