| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by nmstoker 2032 days ago

There's Mozilla TTS https://github.com/mozilla/TTS

Here's a sample from a TTS model + vocoder I released for it. I've no wish to deter the motivated, but it'd take a bit of figuring out how to set things up and you'd need to read the docs and code to get oriented :)

https://m.soundcloud.com/user-726556259/sherlock-wavegrad-sa...

Links to the models are here: https://discourse.mozilla.org/t/creating-a-github-page-for-h...

Is originally trained on two novels read by the same narrator on LibriVox (ie in public domain)

2 comments

nmfisher 2031 days ago

This is actually quite impressive too, significantly better than the last time I looked into Mozilla TTS. Roughly how much audio does "two novels" equate to?

link

nmstoker 2031 days ago

Here's another sample with the same model+vocoder, this time reading from a Wikipedia article: https://m.soundcloud.com/user-726556259/q-learning-wavegrad-...

link

nmstoker 2031 days ago

It's about 32 hours of audio.

As some of the audio is read in different accents to the main accent used, ideally the different accent audio would have been removed. Doing so would be expected to help with voice quality, reducing the overall amount used and, as a bonus, cutting training time too.

link

terrycody 2032 days ago

Is there a simple interface like the example in this thread to use the tool for a non developer regard Mozila TTS yet? I can't find one...

link

nmstoker 2032 days ago

There's the demo server which has a simple web UI where you can input text to be spoken, but in regards to setting it up locally it's not that suited for a non developer

https://github.com/mozilla/TTS/tree/master/TTS/server

https://github.com/mozilla/TTS/wiki/Build-instructions-for-s...

There's also a version in docker: https://github.com/synesthesiam/docker-mozillatts

And various Colabs too, which are fairly easy to get going with: https://github.com/mozilla/TTS/wiki/TTS-Notebooks-and-Tutori...

link