Hacker News new | ask | show | jobs
A collection of reproducible LLM inference engine benchmarks: SGLang vs. vLLM (github.com)
1 points by zhwu 424 days ago