Hacker News new | ask | show | jobs
by almostgotcaught 640 days ago
Because it's slow duh
1 comments

This sounds like prejudice. Have you benchmarked it?
Yes I literally duplicated your approach for my driver stack last week and surprise surprise the FFI overhead into libc is too high.
FFI? This isn't how GPUs work...they are MMIO (mostly)

Those drivers are faster than anything else when used to run fixed command queues (what neural network runs are)