Y
Hacker News
new
|
ask
|
show
|
jobs
Faster Open-Source Llama3 Serving with SGLang Runtime (vs. TensorRT-LLM, VLLM)
(
lmsys.org
)
4 points
by
yvbbrjdr
697 days ago