Hacker News new | ask | show | jobs
by visarga 73 days ago
There are ways to quantize or compress KV cache down.