Y
Hacker News
new
|
ask
|
show
|
jobs
by
someguydave
53 days ago
I got about 7 tokens/sec generation on an M2 max macbook running 8-bit quant on an MLX version.