| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by exabrial 66 days ago
	Why don't we have LLM-Accelerator cards (yet)? Just seems like graphics cards are a good choice, but not a purpose-built choice.

1 comments

cjbprime 66 days ago

Inference (not training) is bottlenecked by memory access speed, not compute. Having special hardware wouldn't make it faster unless you somehow found a faster memory controller than the GPU has.

link