Hacker News new | ask | show | jobs
by pjmlp 1032 days ago
> This new ability to directly read or write to the full application memory address space will significantly improve programmer productivity for all programming models built on top of CUDA: CUDA C++, Fortran, standard parallelism in Python, ISO C++, ISO Fortran, OpenACC, OpenMP, and many others.

This is the part of CUDA alternatives always miss when their models only support C and some C++ subset.

1 comments

Apple Metal does this though.
Only recently. mmap file and directly use in Metal kernels is actually not supported until iOS 16 / macOS 13. Also, there are limited optimization opportunities around that and the recommended way seems still to use the specific Metal APIs to stream load assets from disk.
Although Apple holds a <10% of total market share when it comes to computers, so not sure how helpful it is.
Well Nvidia holds like 90% of the GPU marketshare, so any reply mentioning a competitor would have this property.
You missed the polyglot description regarding which workloads CUDA supports.