| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by georgehotz 640 days ago

Umm, why not?

We wrote entire NVIDIA, AMD, and QCOM drivers in that style.

1 comments

Because it's slow duh

This sounds like prejudice. Have you benchmarked it?

Yes I literally duplicated your approach for my driver stack last week and surprise surprise the FFI overhead into libc is too high.

FFI? This isn't how GPUs work...they are MMIO (mostly)

Those drivers are faster than anything else when used to run fixed command queues (what neural network runs are)