Yes I'd agree with that. There is so much demand for inference which is maturing rapidly that even if a lot of the "R&D" is done on NVidia cards because of their (vastly, let's be fair) software stack, if AMD is competitive on the inference side (and perhaps more importantly have shorter lead times) then doing the inference on AMD is still an enormous market.
I suspect we will (or already are?) at a point where 95%+ of GPUs are used for inference, not training.
I suspect we will (or already are?) at a point where 95%+ of GPUs are used for inference, not training.