Hacker News new | ask | show | jobs
by suprjami 274 days ago
Depending on what you want to do, you already can.

llama.cpp and other inference servers work fine on the kernel driver.

1 comments

Where "fine" unfortunately still means "don't push it too hard on a busy desktop system or your graphical session might crash". Make sure to keep enough RAM free or you start seeing GPU resets, the stack can't cope with transient errors :-(