| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nmfisher 1989 days ago
	This is actually quite impressive too, significantly better than the last time I looked into Mozilla TTS. Roughly how much audio does "two novels" equate to?

2 comments

nmstoker 1989 days ago

Here's another sample with the same model+vocoder, this time reading from a Wikipedia article: https://m.soundcloud.com/user-726556259/q-learning-wavegrad-...

link

nmstoker 1989 days ago

It's about 32 hours of audio.

As some of the audio is read in different accents to the main accent used, ideally the different accent audio would have been removed. Doing so would be expected to help with voice quality, reducing the overall amount used and, as a bonus, cutting training time too.

link