Hacker News new | ask | show | jobs
by dr_pardee 1170 days ago
While I am aware that it is possible to use software to clone someone's voice, I am uncertain if it is currently feasible to utilize the cloned voice in real-time situations.

I am curious if these scammers are pre-recording conversations and playing them back in real-time or if they are actually able to speak using the cloned voice technology.

5 comments

> feasible to utilize the cloned voice in real-time situations

Yes, it has been so for years. The attack to the Dubai Bank in Hong Kong was effected in January 2020.

https://news.ycombinator.com/item?id=29711876

From the article https://www.unite.ai/deepfaked-voice-enabled-35-million-bank...

"Though users at the Audio Fakes Discord and the DeepFaceLab Discord are intensely interested in combining the two technologies into a single video+voice live deepfake architecture, no such product has publicly emerged as yet."

> no such product has publicly emerged as yet

I am not aware of what was used in that occasion; there is also a good chance that the scammers could hack a system internally - the stake is stellar.

Surely, to convince the other part during a phone call a high degree of capability for interaction is required; from the same article, right above your quote:

> we can reasonably assume that the speaker is using a live, real-time deepfake framework

The quoted article is from late 2021. In these current days, we are seeing denounces according to which

> Voice scams are on the rise in a dramatic way. They were the second most common con trick in America last year, and they’re headed for the top spot this year (ex Voice scams: a great reason to be fearful of AI at https://news.ycombinator.com/item?id=35459261 )

The video's shown on YouTube have a number of "plausible" common responses pre-created, that can be played back by pushing a button.

In operation in the video's it seems to work pretty well. :(

If it's not now, it will be soon.
near real time is possible, there's lots of "v-tubers" doing it