I can't distinguish most of them from human, even knowing ahead of time which one is which:
English: https://r9y9.github.io/wavenet_vocoder/
English: https://google.github.io/tacotron/publications/speaker_adapt...
Japanese: https://r9y9.github.io/demos/projects/icassp2020/