Hacker News new | ask | show | jobs
by fho 1988 days ago
I guess the simplest approach to port NNUE to GPUs would be to run a complete instance per GPU thread (ie concurrent, not parallel evaluation).