Hacker News new | ask | show | jobs
by simonw 590 days ago
With MLX:

    Prompt: 49 tokens, 95.691 tokens-per-sec
    Generation: 723 tokens, 10.016 tokens-per-sec
    Peak memory: 32.685 GB
1 comments

so quite usable, thanks!