Hacker News new | ask | show | jobs
by paipa 846 days ago
Would be cool to see what happens if you quantize towards zero preferentially. Sparsifying the matrix should improve inference speed directly, right?