Y
Hacker News
new
|
ask
|
show
|
jobs
by
esafak
83 days ago
No, it
is
about compressing the KV cache; see
How TurboQuant works
.