Hacker News new | ask | show | jobs
by zneveu 2882 days ago
Had an idea to do this a couple months ago, but haven't got around to implementing it yet. I'm curious: did you consider using standard image processing techniques with spectrograms as an alternative to decision trees? I know thats how Izotope does their Neutron instrument detection, but I'm not sure how it would compare performance wise. Also, have you tried classifying percussive sounds that aren't actual drums? I'd love to see how it categorizes various stuff.
1 comments

Hey! In order:

- I did consider using image processing techniques as opposed to decision trees, but the point here was not to come up with the most advanced and accurate classifier possible, but rather to build something simple and explainable to folks without an ML (or even a CA) background.

- I haven't tried this extensively on non-drum-like percussion, but that'd be a great follow up post.

I totally understand your decision, but I bet a 1D deep convolutional network would do really well at this given a larger dataset. You can also do a lot of data augmentation by speed changes, filters, adding reverb, etc.