Hacker News new | ask | show | jobs
by gamblor956 529 days ago
This was a great way for them to collect a lot of free voice data to train their model.
1 comments

Every time there's a voice recognition post here someone comments about acquiring data. Why is this method better than having access to all of the video and podcasts sites on the internet?
You can get people to utter the same sentence.
Righto. Everyone is saying the same thing so it's the cleanest data set you can get.
https://www.youtube.com/watch?v=ksb3KD6DfSI

Just feed the AI broadcasts from "local" news stations.