Y
Hacker News
new
|
ask
|
show
|
jobs
by
zerop
2346 days ago
Do these free audio books collection also has transcripts? I want to use them to train my Speech to text model.
2 comments
wiml
2346 days ago
If you don't already know about it, you'll probably be interested in Mozilla's Common Voice data:
https://voice.mozilla.org/en/datasets
link
jszymborski
2346 days ago
The books that are read are typically available in their original textual form at places like Project Gutenberg which curate texts in the public domain.
As for timestamps, I'm not aware of anything other than the chapter markers.
link