Hacker News new | ask | show | jobs
by m-p-3 1775 days ago
And the audio recordings are also curated by the volunteers, ensuring the audio snippets matches the text, etc.
1 comments

Which, it must be said, isn't always as bullet-proof as it could be. There's a not insignificant amount of transcription (or pronunciation) errors in those datasets and Mozilla might want to find ways to increase the quality of already-released data over time.