Y
Hacker News
new
|
ask
|
show
|
jobs
by
tssge
59 days ago
The GPU has INT4, INT8, BF16 and FP16. Notably no FP8 or FP4.The official GPTQ-Int4 release from Qwen is a great quant for this but custom kernels are still rare for this hardware.
1 comments
moffkalast
59 days ago
Must be a case of the hardware being there and the software not actually supporting it then.
link