Hacker News new | ask | show | jobs
by anon373839 53 days ago
That's not what consumes the most memory at scale. The KV caches are per-user.