Hacker News new | ask | show | jobs
Mooncake: A KVCache-Centric Disaggregated Architecture for LLM Serving (github.com)
13 points by zinccat 726 days ago