Hacker News new | ask | show | jobs
by atlex2 585 days ago
Seriously nobody thought to use SVD on these weight matrices before?
1 comments

I did try, but in a wrong way (try to SVD quantization error to recover quality (I.e. SVD(W - Q(W)))). The lightbulb moment in this paper is to do SVD on W and then quantize the remaining.