| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by infinite8s 1761 days ago
	Yes agreed, this is great! The best I found that could generate faster than real-time without a GPU was speedyspeech (https://github.com/janvainer/speedyspeech). Unfortunately it was only trained using the LJSpeech dataset and I haven't been able to transfer to a multi-voice model. I have been using it to build an story-telling app for my kids.

1 comments

> been using it to build an story-telling app for my kids.

Oh, that's cool! :) Has some overlap with part of my interest in TTS technologies.

The existence of 50 voices for Larynx is definitely a significant part of what makes it an exciting development in this sphere of use.