Hacker News new | ask | show | jobs
by iplaman 3377 days ago
Thank you this is very interesting. I wonder about your initial train config, wouldn't it be more efficient with time in mind for demo purposes, to use more wav samples with less epochs?
1 comments

I agree that it would be more efficient to have more wav files in the github repo, but we kept them minimal to reduce the total file size when cloning the repository. You can find more of the Librispeech data here: http://www.openslr.org/12/

We kept the epochs at 100 to demonstrate the negative consequence of overfitting training data, when doing the test or dev set evaluations. We could probably reduce that to ~50 though to save time in training :)