Hacker News new | ask | show | jobs
by ekidd 285 days ago
> I would also like to know who the people behind Z.ai are — I haven’t heard of them before.

To be clear, Z.ai are the people who built GLM 4.5, so they're talking up their own product.

But to be fair, GLM 4.5 and GLM 4.5 Air are genuinely good coding models. GLM 4.5 Air costs about 10% of what Claude Sonnet does (when hosted on DeepInfra, at least), and it can perform simple coding tasks quite quickly. I haven't tested GLM 4.5 Air, but it seems to be popular as well.

If you can easily afford all the Claude Code tokens you want, then you'll probably get better results from Sonnet. But if you already know enough programming to work around any issues that arise, the GLM models are quite usable.

But you can't easily run GLM 4.5 Air quickly without professional workstation- or server-grade hardware (RTX 6000 Pro 96GB would be nice), at least not without a serious speed hit.

Still, it's a very interesting sign for the future of open coding models.

1 comments

For agentic coding I found the price difference more modest due to prompt caching, which most GLM providers on Openrouter don't offer, but Anthropic does. Look at the cache read/write columns: https://openrouter.ai/z-ai/glm-4.5
Been playing with Grok Code Fast 1 in Cline via Open Router. It supports prompt caching as far as I can tell, and it certainly is cheap. It's been quite good for the stuff I've tried. YMMV.