|
|
|
|
|
by woodson
3091 days ago
|
|
It's not just about the amount of data, though. The speech and audio quality of Google's TTS data is likely better than the audio contained in the LJ dataset (disclaimer: I've only listened to samples contained in the latter, which have some audible reverberation). Ideally, you'd use a professionally trained speaker and record them in a (semi-)anechoic chamber. |
|