Hacker News new | ask | show | jobs
by ekelsen 2402 days ago
TLDR; Make 1x1 convolutions sparse, write fast Sparse Matrix Multiplication kernels, get a nearly 2x speedup with smaller models.