Hacker News new | ask | show | jobs
Storage based KVCache for denser token factory (blogs.oracle.com)
1 points by baruch 37 days ago
1 comments

It is possible to get more tokens out of the same hardware by leveraging fast storage for KVCache, it is especially useful for agentic workloads.