|
|
|
|
|
by amkkma
2096 days ago
|
|
The GPU gap is only if written in the high level index or loop style. There is little to no gap if done either using array abstractions (broadcast, map etc) or at a level similar to Cuda C (though with nicer Julia abstractions and syntax): https://juliagpu.org/cuda/ The Julialab at MIT is working on making the higher level codegen faster |
|