| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by conradev 253 days ago
	They’re already optimizing GPU die area for LLM inference over other pursuits: the FP64 units in the latest Blackwell GPUs were greatly reduced and FP4 was added