| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pjmlp 1032 days ago
	> This new ability to directly read or write to the full application memory address space will significantly improve programmer productivity for all programming models built on top of CUDA: CUDA C++, Fortran, standard parallelism in Python, ISO C++, ISO Fortran, OpenACC, OpenMP, and many others. This is the part of CUDA alternatives always miss when their models only support C and some C++ subset.

1 comments

mratsim 1032 days ago

Apple Metal does this though.

link

liuliu 1032 days ago

Only recently. mmap file and directly use in Metal kernels is actually not supported until iOS 16 / macOS 13. Also, there are limited optimization opportunities around that and the recommended way seems still to use the specific Metal APIs to stream load assets from disk.

link

capableweb 1032 days ago

Although Apple holds a <10% of total market share when it comes to computers, so not sure how helpful it is.

link

Me1000 1032 days ago

Well Nvidia holds like 90% of the GPU marketshare, so any reply mentioning a competitor would have this property.

link

pjmlp 1032 days ago

You missed the polyglot description regarding which workloads CUDA supports.

link