Hacker News new | ask | show | jobs
by syntaxers 1259 days ago
On your first point, there's some really good results from this paper:

Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers https://arxiv.org/abs/2301.02111

Website with examples: https://valle-demo.github.io/

For your second question, Apple is already rolling out AI-narrated audiobooks. See: https://arstechnica.com/gadgets/2023/01/apple-rolls-out-ai-n...