Y
Hacker News
new
|
ask
|
show
|
jobs
by
baggiponte
314 days ago
Yeah. The docs tell you that you should build it yourself, but…
1 comments
tough
314 days ago
but unlike cuda there's no custom kernels for inference in vllm repo...
I think
link
I think