Y
Hacker News
new
|
ask
|
show
|
jobs
by
paipa
846 days ago
Would be cool to see what happens if you quantize towards zero preferentially. Sparsifying the matrix should improve inference speed directly, right?