Y
Hacker News
new
|
ask
|
show
|
jobs
by
daemonologist
473 days ago
It needs about 22 GB of memory after 4 bit AWQ quantization. So top end consumer cards like Nvidia's 3090 - 5090 or AMD's 7900 XTX will run it.