Hacker News new | ask | show | jobs
by fxtentacle 1406 days ago
For Romanian, I believe someone would first have to collect a large dataset of speech recordings together with groundtruth text. Even if you find cheap narrators working for $10 per hour, that's still $100k for 10k hours of data.