Hacker News new | ask | show | jobs
by jgalt212 2 hours ago
> as input tokens that are already in the KV cache are practically free for the provider,

not at today's RAM prices.