Hacker News new | ask | show | jobs
by AlyssaRowan 712 days ago
They're all actually AI powered, generally some form of real-time RNN trained on identifying and isolating voice content from background noise or music.

rnnoise2 is an open-source model that does very well. There also are things like Waves Clarity VX, the Nvidia Broadcast (Audio Effects SDK) too, as well as plenty of other solutions like Supertone Clear, Krisp, etc etc etc.

1 comments

Does that mean youtube is AI generating your voice to "add it back" after silencing that part of the video? Does it ever generate different words to what you actually said?