Hacker News new | ask | show | jobs
by daemonologist 473 days ago
It needs about 22 GB of memory after 4 bit AWQ quantization. So top end consumer cards like Nvidia's 3090 - 5090 or AMD's 7900 XTX will run it.