| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by oofbey 91 days ago
	Most everything starts as PyTorch. (Or maybe Jax.) But the inference engines all use hand tuned CUDA kernels - at least the good ones do. You have to do that to optimize things.

1 comments

overfeed 91 days ago

I'm certain inference engines don't use hand-tuned CUDA on Radeon or Mac Mini chips. My statement holds: those engines have no strict dependency on CUDA, or they'd be Nvidia-only.

link