Hacker News new | ask | show | jobs
by petercooper 933 days ago
(I spent too much time to listen to talk radio in the 90s to understand this question at first. It's not about temporal reversal, but trying to reveal the original audio behind an AI enhanced/transcoded voice.)

I think so. There's a whole field of voice biometrics working in this area. I've experimented with such tools and you have to work hard to copy someone's vocabulary, timing, and cadence. If you speak or sing in your normal voice and convert it, there are huge tells, somewhat akin to those used in stylometry to identify the owners of sock accounts (indeed, if someone actually used TTS, it mostly becomes a stylometry problem, unless services like ElevenLabs were to add inaudible watermarks or something).