Hacker News new | ask | show | jobs
by swatcoder 702 days ago
I think it's hard to see the use case right now because the quality remains pretty dreadful.

But the prototypical legitimate use case (which we needn't be excited about), is a voice over artist leasing their timbre instead of their time so that new text can be made to sound like them without their being actively involved. If it were to become mature (which doesn't seem close, from this example), it would be a big step up from existing phone tree voice assemblage and would open the doors for dubbing, animation voiceover, harmonization, and ADR in commercial sound and film.

Gender masking or general anonymization aren't really served by this, as you don't need to adopt a specific target timbre to deliver on those. There are other techniques that work perfectly well for those uses, some that have already been around for ages.