Hacker News new | ask | show | jobs
by mekpro 320 days ago
i got 70 token/s on m4 max
1 comments

That M4 Max is really something else, I get also 70 tokens/second on eval on a RTX 4000 SFF Ada server GPU.