|
|
|
|
|
by infinite8s
1761 days ago
|
|
Yes agreed, this is great! The best I found that could generate faster than real-time without a GPU was speedyspeech (https://github.com/janvainer/speedyspeech). Unfortunately it was only trained using the LJSpeech dataset and I haven't been able to transfer to a multi-voice model. I have been using it to build an story-telling app for my kids. |
|
Oh, that's cool! :) Has some overlap with part of my interest in TTS technologies.
The existence of 50 voices for Larynx is definitely a significant part of what makes it an exciting development in this sphere of use.