| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by otabdeveloper4 961 days ago
	Using a full-blown GPU just for neural network inference is crazy inefficient. They should hire some blockchain dudes to build them custom hardware for one tenth of the price.

2 comments

otterley 961 days ago

NPUs already exist and are everywhere (Apple Silicon, AWS Inferentia). AWS is leading at scale in the cloud in this space.

(Disclaimer: I work for AWS, but opinions are my own.)

link

otterley 960 days ago

I failed to include Google TPUs. My apologies, GCP folks!

link

fragmede 960 days ago

Google hired a bunch of chip designers to make TPUs for their ML people, so are highly optimized for this kind of work, which are currently on their 5th gen, and are available on their cloud.

(Disclaimer: I used to work there but not on them.)

link