Hacker News new | ask | show | jobs
by moondistance 361 days ago
Yes, big time, but there continues to be lots of progress.

Most importantly, models are maturing, and this means less custom optimization is required.

1 comments

Yes I'd agree with that. There is so much demand for inference which is maturing rapidly that even if a lot of the "R&D" is done on NVidia cards because of their (vastly, let's be fair) software stack, if AMD is competitive on the inference side (and perhaps more importantly have shorter lead times) then doing the inference on AMD is still an enormous market.

I suspect we will (or already are?) at a point where 95%+ of GPUs are used for inference, not training.