Hacker News new | ask | show | jobs
by zerop 2346 days ago
Do these free audio books collection also has transcripts? I want to use them to train my Speech to text model.
2 comments

If you don't already know about it, you'll probably be interested in Mozilla's Common Voice data: https://voice.mozilla.org/en/datasets
The books that are read are typically available in their original textual form at places like Project Gutenberg which curate texts in the public domain.

As for timestamps, I'm not aware of anything other than the chapter markers.