| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by nabakin 78 days ago
	If OP meant they have the fastest implementation of Gemma 4 on Blackwell at the moment, I guess that is technically true. I doubt that will hold up when TensorRT-LLM finishes their implementation though.

1 comments

How is the sglang performance on Blackwell for this model?

Dunno but there's a PR for it. Probably also more performant than Modular.