Hacker News new | ask | show | jobs
by beatle_sauce 2275 days ago
IMHO the speech dataset list is missing other interesting free corpora, e.g. the TEDlium dataset, Voxforge, Common Voice. A more comprehensive (but not complete) list can be found here: https://github.com/kaldi-asr/kaldi/tree/master/egs (download links can be found in the scripts)