| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by cschneid 264 days ago

so apparently they have custom hardware that is basically absolutely gigantic chips - across the scale of a whole wafer at a time. Presumably they keep the entire model right on chip, in effectively L3 cache or whatever. So the memory bandwidth is absurdly fast, allowing very fast inference.

It's more expensive to get the same raw compute as a cluster of nvidia chips, but they don't have the same peak throughput.

As far as price as a coder, I am giving a month of the $50 plan a shot. I haven't figured out how to adapt my workflow yet to faster speeds (also learning and setting up opencode).

1 comments

bigyabai 264 days ago

For $50/month, it's a non-starter. I hope they can find a way to use all this excess bandwidth to put out a $10 equivalent to Claude Code instead of a 1000 tok/s party trick I can't use properly.

link

typpilol 264 days ago

I feel the same and it's also why I can't understand all these people using small local models.

Every local model I've used and even most open source are just not good

link

behnamoh 264 days ago

the only good-enough model I still use it gpt-oss-120b-mxfp4 (not 20b) and glm-4.6 at q8 (not q4).

quantization ruins models and some models aren't that smart to begin with.

link

csomar 264 days ago

GLM-4.6 is on par with Sonnet 4.5. Sometimes it is better, sometimes it is worse. Give it a shot. It's the only model that made me (almost) ditch Claude. The only problem is, Claude Code is still the best agentic program in town and search doesn't function without a proper subscription.

link

DeathArrow 263 days ago

Have you tried Claude Code Router with GLM 4.6?

https://github.com/musistudio/claude-code-router

link

mcpeepants 264 days ago

z.ai hosted GLM 4.6 works great with claude code, drops right in

link

esafak 263 days ago

Have you tried opencode?

link

wyre 263 days ago

Cerebras offers pay-per-token. What are you asking for? Claude Code starts at $100, or $15/mtok. Cerebras is already much cheaper, but you want it to be even cheaper at $10?

link

xadhominemx 263 days ago

$600 per year is a trivial cost for a professional tool

link

bigyabai 263 days ago

$600 per anything is Herman Miller territory, pal. I'm not paying that for a SaaS.

link