| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tmostak 736 days ago
	This assumes that you can linearly scale up the number of TPUs to get equal performance to Nvidia cards for less cost. Like most things distributed, this is unlikely to be the case.

1 comments

logicchains 736 days ago

This is absolutely the case, TPUs scale very well: https://github.com/google/maxtext .

link

pama 736 days ago

The repo mentiones a Karpathy tweet from Jan 2023. Andrej has recently created llm.c and the same model trained about 32x faster on the same NVidia hardware mentioned in the tweet. I dont think the perfomance estimate that the repo used (based on that early tweet) was accurate for the performance of the NVidia hardware itself.

link