Hacker News new | ask | show | jobs
by ma2rten 1775 days ago
Why does data for speech recognition need to be prefect. That's certainly not the case for other machine learning applications. Can you train the less clean data and fine-tune on a clean subset?
1 comments

Well that was kind of my point: you need to manually figure out what’s clean and what isn’t.
But it's easy to do that for a small subset for finetuning compared to cleaning up the entire dataset.