Hacker News new | ask | show | jobs
by mark_l_watson 526 days ago
I am blown away: a year ago I bought a M2 32G Mac to run local models. It seems like what I can run locally now just one year later is 10x more useful for NLP, data wrangling, RAG, experimenting with agents, etc.

BTW, a few days ago I published a book on using Ollama. Here is a link to read it online https://leanpub.com/ollama/read

1 comments

Which models do you recommend for that amount of memory?
I asked the same question a few days back and I'm keeping the responses here: https://bsky.app/profile/potato.horse/post/3lejngewfmc2n
For reasoning: qwq:latest (19G file)

For coding: qwen2.5-coder:14b (9G file)

Misc. experiments, runs fast: llama3.2:latest )2 G file)