Hacker News new | ask | show | jobs
by buildbot 15 days ago
Not that it super matters, but random hadamards for quantization have been a thing since way before turboquant.

https://arxiv.org/abs/2404.00456

1 comments

Which llama.cpp now does.