Hacker News new | ask | show | jobs
by crowwork 533 days ago
Scale LLM serving with programmable cross-engine serving patterns, all in a few lines of Python