Hacker News new | ask | show | jobs
by telotortium 1292 days ago
What seems most likely is that OpenAI and other LLM trainers are going to proceed to training on transcripts of YouTube videos and podcasts using the Whisper text-to-speech model, which at its largest sizes is really quite state-of-the-art. For now, it seems like most of this content is still organic (or if it's not, the computer-generated speech is relatively easy to distinguish for now).