Hacker News new | ask | show | jobs
by bryanlarsen 582 days ago
Anybody spending tens of billions annually on Nvidia hardware is going to be willing to spend millions to port their software away from CUDA.
3 comments

First they need to support everything that CUDA is capable of in programing language portfolio, tooling and libraries.
A typical LLM might use about 0.1% of CUDA. That's all that would have to be ported to get that LLM to work.
Which is missing the point why CUDA has won.

Then again, maybe the goal is getting 0.1% of CUDA market share. /s

Nvidia has won because their compute drivers don't crash people's systems when they run e.g. Vulkan Compute.

You are mostly listing irrelevant nice to have things that aren't deal breakers. AMD's consumer GPUs have a long history of being abandoned a year or two after release.

CUDA C++, CUDA Fortran, CUDA Anything PTX, plus libraries, IDE integration, GPU graphical debugging, aren't only nice to have things.
In the words of Gilfoyle-- I'll bite. Why has CUDA won?
CUDA C++, CUDA Fortran, CUDA Anything PTX, plus libraries, IDE integration, GPU graphical debugging.

Coupled with Khronos, Intel, AMD never delivering anything comparable with OpenCL, Apple losing interest after Khronos didn't took OpenCL into the direction they wanted, Google never adopting it favouring their Renderscript dialect.

For the average non-FAANG company, there's nothing to port to yet. We don't all have the luxury of custom TPUs.
To slower hardware? What are they supposed to port to, ASICs?
if the hardware is 30% slower and 2x cheaper, that's a pretty great deal.
Power density tends to be the limiting factor for this stuff, not money. If it's 30 percent slower per watt, it's useless.
The ratio between power usage and GPU cost is very, very different than with CPUs, though. If you could save e.g. 20-30% of the purchase price that might make it worth it.

e.g. you could run a H100 at 100% utilization 24/7 for 1 years at $0.4 per kWh (so assuming significant overhead for infrastructure etc.) and that would only cost ~10% of the purchase price of the GPU itself.

Power usage cost isn't the money but the capacity and cooling.
Yes, I know that. Hence I quadrupled the price of electricity or are you saying that the cost of capacity and cooling doesn't scale directly with power usage?

We can increase that another 2x and the cost would still be relatively low compared to the price/deprecation of the GPU itself.