Y
Hacker News
new
|
ask
|
show
|
jobs
by
aliljet
36 days ago
Where can a user reasonably host this in an affordable way to access the local LLM revolution?
4 comments
plagiarist
36 days ago
I think their Max models are far bigger than fits on consumer hardware. People are typically using Apple, AMD Halo, or dGPUs if/when they do smaller versions. Those are all varying degrees of "affordable."
link
satvikpendem
36 days ago
Unsloth Studio with its MTP support:
https://unsloth.ai/docs/models/qwen3.6#mtp-guide
link
julianlam
36 days ago
Try llama.cpp and Qwen3.6-35B-A3B
Good balance of intelligence and speed.
link
truetotosse
35 days ago
This one is not local
link