Hacker News new | ask | show | jobs
Playing Around with OpenAI's GPT Realtime Voice API (nathancooper.io)
2 points by coop57 34 days ago
1 comments

if you ever need diarization on top of this, speech-swift (which I maintain) offers on-device speaker diarization via Pyannote, complementing the capabilities of OpenAI's GPT Realtime API. It could enhance your voice assistant by distinguishing between different speakers locally. https://soniqo.audio/guides/diarize