| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by modeless 1878 days ago
	You can do the linear parts of neural nets in the frequency domain, but AFAIK you can't do the nonlinearity, so you have to inverse transform back to the spatial domain for that. The nonlinearity is an absolutely essential part of pretty much every neural net layer, so there is no big win to be had unfortunately. For convolutional nets in particular there are other ways of going faster than a naive matrix multiply, e.g. winograd convolution.

2 comments

aesthesia 1878 days ago

But if your network contains a layer whose linear part performs an (approximate) DFT, you will get an efficiency gain by replacing it with an exact FFT.

You wouldn't want to use an FFT for most CNNs anyway because the kernels have very small support. Convolution with them is O(n) in the spatial domain as long as you recognize the sparsity.

link

evancox100 1878 days ago

Why can't you apply the nonlinear activation functions in the frequency domain? What's stopping this or making it not work?

link