|
|
|
|
|
by shihab
483 days ago
|
|
For future readers, note that those 3x and 10x figures are compared to vLLM's own previous release, and NOT compared to Deepseek's implementation. I am very curious to see how well-optimized Deepseek's code is compared to leading LLM serving softwares like vLLM or SGLang. |
|