Hacker News new | ask | show | jobs
by snickell 499 days ago
M2 max with 64GB: 14 tokens/s running `ollama run mistral-small:24b --verbose`