| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by yunohn 241 days ago
	Yeah I’m honestly unclear on Nvidia’s thinking here - inference speed is unbelievably slow for the price. Given the extreme advantage they have with CUDA and the whole AI/ML ecosystem, barely matching Apple’s M-ultra speeds is a choice…

1 comments

Definitely a choice to give it low memory bandwidth. Probably to avoid customers thinking it can replace any data center GPU for inference use-cases.