Hacker News new | ask | show | jobs
by georgehotz 640 days ago
Umm, why not?

We wrote entire NVIDIA, AMD, and QCOM drivers in that style.

https://github.com/tinygrad/tinygrad/blob/master/tinygrad/ru...

https://github.com/tinygrad/tinygrad/blob/master/tinygrad/ru...

https://github.com/tinygrad/tinygrad/blob/master/tinygrad/ru...

1 comments

Because it's slow duh
This sounds like prejudice. Have you benchmarked it?
Yes I literally duplicated your approach for my driver stack last week and surprise surprise the FFI overhead into libc is too high.
FFI? This isn't how GPUs work...they are MMIO (mostly)

Those drivers are faster than anything else when used to run fixed command queues (what neural network runs are)