The writer is an AI researcher at Bramble Intelligence and worked on the State of AI Report 2025
Until recently, building an artificial intelligence system that could hold a convincing phone conversation was a laborious task. You had to combine separate tools for speech recognition, language processing and speech synthesis, all linked through fragile telephony software.
This is no longer true. The arrival of real-time, speech-native AI models such as OpenAI’s RealTime API, launched last year, means a system that once required multiple components can now be created in minutes.
您已阅读15%(580字),剩余85%(3215字)包含更多重要信息,订阅以继续探索完整内容,并享受更多专属服务。