|
|
|
|
|
by telotortium
1292 days ago
|
|
What seems most likely is that OpenAI and other LLM trainers are going to proceed to training on transcripts of YouTube videos and podcasts using the Whisper text-to-speech model, which at its largest sizes is really quite state-of-the-art. For now, it seems like most of this content is still organic (or if it's not, the computer-generated speech is relatively easy to distinguish for now). |
|