Hacker News new | ask | show | jobs
by colonCapitalDee 462 days ago
Thankfully the ONNX runtime supports AMD's ROCm. The performance is nowhere close to Nvidia's TensorRT, but doing inference on an AMD GPU is doable. I haven't ran the numbers, but I wouldn't be surprised if you could get more throughput per dollar on an AMD GPU for at least some scenarios. AFAIK there aren't any great options for training though, so if you want to do anything besides running ONNX models your options are limited.