Hacker News new | ask | show | jobs
by mandiantBob 1992 days ago
This is not correct. The security page only shows that they use Google for Speech to Text.

Instead, Text to Speech is done using technology developed by Lyrebird.ai, which Descript has bought over. Descript rebranded it as Overdub. Note that style transfer learning of voices is a hard problem and Overdub seems to have nailed it perfectly. I speculate that the underlying technology of Overdub is based on sv2tts (https://arxiv.org/abs/1806.04558).

The closest comparision to Overdub would be https://www.resemble.ai/.