Y
Hacker News
new
|
ask
|
show
|
jobs
by
SlavikCA
56 days ago
I'm running it on my Intel Xeon W5 with 256GB of DDR5 and Nvidia 72GB VRAM. Paid $7-8k for this system. Probably cost twice as much now.
Using UD-IQ4_NL quants.
Getting 13 t/s. Using it with thinking disabled.
1 comments
GrayShade
56 days ago
I get 20 t/s on the UD-Q6_K_XL quant, Radeon 6800 XT.
link