Hacker News new | ask | show | jobs
by wbl 555 days ago
The gradient information in backroom can be computed similarly to forwards I think. Certainly the FFT blocks are linear and so now it's a question about the multiplication which is pretty compact.