Hacker News new | ask | show | jobs
LLM Microserving: a new RISC-style approach to design LLM serving API (blog.mlc.ai)
4 points by jinhongyii 535 days ago
1 comments

Scale LLM serving with programmable cross-engine serving patterns, all in a few lines of Python