Hacker News new | ask | show | jobs
by smcleod 38 days ago
Apple Silicon before the M4 does not have matmul instructions which causes the prompt processing to be very slow. It's quite different on the M5, much like using a nvidia GPU