Y
Hacker News
new
|
ask
|
show
|
jobs
by
conradev
253 days ago
They’re already optimizing GPU die area for LLM inference over other pursuits: the FP64 units in the latest Blackwell GPUs were greatly reduced and FP4 was added