Hacker News new | ask | show | jobs
by stev-dl 1058 days ago
Looks like they're adding in quite a few "multimodal" features in GPT-5. Emphasis on audio: artificial speech production, audio-to-text, voice recognition - likely building on Whisper. Translation for text/speech also seems on the roadmap.