| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by AlyssaRowan 712 days ago
	They're all actually AI powered, generally some form of real-time RNN trained on identifying and isolating voice content from background noise or music. rnnoise2 is an open-source model that does very well. There also are things like Waves Clarity VX, the Nvidia Broadcast (Audio Effects SDK) too, as well as plenty of other solutions like Supertone Clear, Krisp, etc etc etc.

1 comments

CursedUrn 712 days ago

Does that mean youtube is AI generating your voice to "add it back" after silencing that part of the video? Does it ever generate different words to what you actually said?

link