Hacker News new | ask | show | jobs
by gloosx 481 days ago
Impressive, but I think this is missing two important things to not sound robotic – some atmosphere and space. During a real conversation, both partners are in some kind of a space, either in room, park, car or just on foot in the street. So the voice must have a little bit of reverb according to the space this voice is located in, and there must be some bits of background noise present from that same space. Even lip movement provides some tiniest background noises when you speak which contributes to making the sound real.
1 comments

Which is... annoying in voice interactions on the web. I purposefully set up my mic to avoid any echo and sound pretty direct like a radio host. Adding a simulated environment is less of a problem than getting a good baseline.
I think every microphone will give you some characteristic atmosphere and space for the voice recorded, so it's kind of a part of a sound baseline. It's only annoying when there is too much, but when it's only on the edge of perceivable it adds that naturality to the sound. You can reduce it to the minimum of course, but you cannot completely eliminate it. That slight room tone or mic signature kind of glues everything together, making it feel more real.