Hacker News new | ask | show | jobs
by georgehotz 640 days ago
This sounds like prejudice. Have you benchmarked it?
1 comments

Yes I literally duplicated your approach for my driver stack last week and surprise surprise the FFI overhead into libc is too high.
FFI? This isn't how GPUs work...they are MMIO (mostly)

Those drivers are faster than anything else when used to run fixed command queues (what neural network runs are)