Hacker News new | ask | show | jobs
by esafak 83 days ago
No, it is about compressing the KV cache; see How TurboQuant works.