>"Most models are trained on the original speaker's voice, but maybe only a little bit."
Really cool that you got this to work. I used to work on TTS (a few years ago, now), and we trained on celebrity voices, but used full audiobooks. https://github.com/Kyubyong/tacotron
Thanks for making this so open and accessible.