| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by BuildTheRobots 176 days ago
	I don't suppose you could point to any resources on where I could get started. I have a M2 with 64gb of unified memory and it'd be nice to make it work rather than burning Github credits.

2 comments

EagnaIonat 176 days ago

https://ollama.com

Although I'm starting to like LMStudio more, as it has more features that Ollama is missing.

https://lmstudio.ai

You can then get Claude to create the MCP server to talk to either. Then a CLAUDE.md that tells it to read the models you have downloaded, determine their use and when to offload. Claude will make all that for you as well.

link

shen 175 days ago

Which local models are you using for the 32gb MacBooks?

link

EagnaIonat 175 days ago

Mainly gpt-oss-20b as the thinking mode is really good. I occasionally use granite4 as it is a very fast model. But any 4GB model should easily be used.

link

eek2121 175 days ago

LM Studio is fantastic for playing with local models.

link