Hacker News new | ask | show | jobs
by tantony 1194 days ago
Stable Diffusion runs pretty fast on Apple Silicon. Not sure if that uses the GPU though.

I think one reason in this particular case may be the 4-bit quantization.

1 comments

Quantization is the answer here. CPU running the large models at 16 bits (which is actually 32, because CPUs mostly do not support FP16) would be really slow.