Hacker News new | ask | show | jobs
by lostmsu 81 days ago
In large providers KV caches are the main bottleneck, no?