Hacker News new | ask | show | jobs
by zozbot234 23 days ago
That's also a game changer for local inference. It unlocks long contexts, batched inference and storing the KV cache to disk on ordinary consumer platforms.