Hacker News new | ask | show | jobs
by simonw 226 days ago
That really is the most interesting question for me: when will it be possible to run a model that is good enough to drive Claude Code or Codex CLI on consumer hardware?

gpt-oss-120b fits on a $4000 NVIDIA Spark and can be used by Codex - it's OK but still nowhere near the bigger ones: https://til.simonwillison.net/llms/codex-spark-gpt-oss

But... MiniMax M2 benchmarks close to Sonnet 4 and is 230B - too big for one Spark but can run on a $10,000 Mac Studio.

And Kimi K2 runs on two Mac Studios ($20,000).

So we are getting closer.

1 comments

Also, at some point the Blackwell-generation DGX Station is supposed to ship with 768 GB of unified memory. It will presumably come with a high five-figure price tag, and it should be able to run most open-source models with little need to trade off quality for speed.

Trouble is, there's not even much hype surrounding the launch yet, much less shipping hardware. Which seems kind of ominous.