Hacker News new | ask | show | jobs
by oynqr 508 days ago
Running the ollama 671b 4 bit quant on a 7950X3D with 128GiB RAM, I get like 1-2 t/s.