Hacker News new | ask | show | jobs
by jazzyjackson 107 days ago
Tokens per second is abysmal no matter how much ram you have
1 comments

Some models run worse than others but I have gotten reasonable performance on my M4 Pro with 24 GB of RAM