| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by baggiponte 314 days ago
	Yeah. The docs tell you that you should build it yourself, but…

1 comments

but unlike cuda there's no custom kernels for inference in vllm repo...

I think