Hacker News new | ask | show | jobs
by someguydave 53 days ago
I got about 7 tokens/sec generation on an M2 max macbook running 8-bit quant on an MLX version.