Y
Hacker News
new
|
ask
|
show
|
jobs
by
avaer
22 hours ago
Note this can run locally on a gaming card with quant. I got it running on a 4090 (24GB) 150 t/s with a Q4_K_M.