|
|
|
|
|
by simonw
226 days ago
|
|
That really is the most interesting question for me: when will it be possible to run a model that is good enough to drive Claude Code or Codex CLI on consumer hardware? gpt-oss-120b fits on a $4000 NVIDIA Spark and can be used by Codex - it's OK but still nowhere near the bigger ones: https://til.simonwillison.net/llms/codex-spark-gpt-oss But... MiniMax M2 benchmarks close to Sonnet 4 and is 230B - too big for one Spark but can run on a $10,000 Mac Studio. And Kimi K2 runs on two Mac Studios ($20,000). So we are getting closer. |
|
Trouble is, there's not even much hype surrounding the launch yet, much less shipping hardware. Which seems kind of ominous.