Hacker News new | ask | show | jobs
by vochsel 629 days ago
They've really nailed the back and fourth of the two speakers!

It would be interesting to know if it's multimodal voice, or just clever prompting and recombining...

I added single voice podcasts to Magpai after seeing how useful this was. Allows for a bit more customisation of the podcast too https://www.youtube.com/watch?v=OEsh9MlbA6s

I've got a daily podcast of hackernews being generated here too: https://www.magpai.app/share/n7R91q

1 comments

It's almost certainly Google SoundStorm, a traditional TTS trained on dialogs from last year: https://x.com/jonathanfly/status/1675987073893904386