Hacker News new | ask | show | jobs
by veunes 82 days ago
Classic academic move. If the authors show accuracy-vs-space charts but hide end-to-end latency, it usually means their code is slower in practice than vanilla fp16 without any compression. Polar coordinates are absolute poison for parallel GPU compute
1 comments

I don't think they're using polar coordinates? They're quantizing to grid centroids.