| Backpropping filter coefficients is clever, but it hasn't really caught on much. Google also tried with LEAF (https://github.com/google-research/leaf-audio) to have a learnable audio filterbank. Anyway, in audio ML what is very common is: a) Futzing with the way you do feature extraction on the input. (Oh, maybe I want CQT for this task or a different scale Mel etc) b) Doing feature extraction on generated audio output, and constructing loss functions from generated audio features. So, as I said, I don't exactly see the utility of this library for deep learning. With that said, it is definitely nice to have really high speed low latency audio algorithms in C++. I just wouldn't market it as "useful for deep learning" because a) during training, you need more flexibility than non-GPU methods without backprop b) if you are doing "deep learning" then your inferred model will presumably be quite large, and there will be a million other things you'll need to optimize to get real-time inference or inference on CPUs to work well. Is just my gut reaction. It seems like a solid project, I just question the one selling point of "useful for deep learning" that's all. |