Hacker News new | ask | show | jobs
by brucethemoose2 1095 days ago
There have actually been some papers on "better than SOTA" TTS speech models with shockingly good inflection, emotion, voice imitation and such.

But the orgs behind them say they are hesitant to release them due to obvious misuse concerns. And I think the unspoken concern is that the datasets are not clean.