|
|
|
|
|
by stev-dl
1058 days ago
|
|
Looks like they're adding in quite a few "multimodal" features in GPT-5. Emphasis on audio: artificial speech production, audio-to-text, voice recognition - likely building on Whisper. Translation for text/speech also seems on the roadmap. |
|