Hacker News new | ask | show | jobs
by brandall10 918 days ago
I get 40 tok/sec on my M3 Max on various 34B models, I gather a desktop 4090 would be at least 80?