Y
Hacker News
new
|
ask
|
show
|
jobs
by
CharlesW
244 days ago
Actual title: "Solve the GPU Cost Crisis with kvcached: A library to enable virtualized, elastic KV cache for LLM serving on shared GPUs"
1 comments
dang
243 days ago
Yes, we've put that in the title above (shortened to fit HN's 80 char limit). Submitted title was "Time to build a GPU OS? Here is the first step".
link