| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by aliljet 36 days ago
	Where can a user reasonably host this in an affordable way to access the local LLM revolution?

4 comments

plagiarist 36 days ago

I think their Max models are far bigger than fits on consumer hardware. People are typically using Apple, AMD Halo, or dGPUs if/when they do smaller versions. Those are all varying degrees of "affordable."

link

satvikpendem 36 days ago

Unsloth Studio with its MTP support: https://unsloth.ai/docs/models/qwen3.6#mtp-guide

link

julianlam 36 days ago

Try llama.cpp and Qwen3.6-35B-A3B

Good balance of intelligence and speed.

link

truetotosse 35 days ago

This one is not local

link