Hacker News new | ask | show | jobs
by acacac 687 days ago
will the model ever be extended to being able to segment audio (eg. different people talking, different instruments in a soundtrack?)
3 comments

Check out Facebook DeMucs, and more newer: Ultimate Vocal Remover project on GitHub
There are a ton of models that do Stemming like this. We use them all the time. Lookup MvSep on Replicate.com
That would be really cool to try out. I hope someone is doing that.