Hacker News new | ask | show | jobs
by eckelhesten 4 hours ago
Sure, but whatever you do, don't buy their (Z.ai) lite plan.

I feel like i threw 15 dollars in the sea. I'm getting rate limited after 3-4 prompts. You get way less value than just paying 25 dollars for Claude or OpenAI models.

2 comments

Did you consider their peak hours and model usage multiplier? Read the green box https://docs.z.ai/devpack/overview#usage-instruction

I had the Lite plan, I NEVER maxed out the quota because I considered these things. If I, for example, switched over to GLM-5-Turbo, then I could've easily burned through quota.

How are you using it? I have the lite plan and I've only ever maxed my weekly usage a few hours before reset. I will concede that I'm not a super heavy LLM user but it's been really good for me.

My workflow is usually:

- read file. I want to achieve X, how do? Do not implement anything.

- I would do a, b and c

- sketch a brief implementation of your suggestion

- <code> (not writing files yet)

- instead of your approach x, wouldn't it make sense to instead do z? What would that look like?

- <code>

- nice, implement this

- starts writing files, run tests, etc.

Try pointing it to a small codebase, or even ask it to conjure information found online.

You'll see that it quickly gives up. Thing is, they seem to count cached hits as if they were the non-cached tokens.

I wont be subscribing again thats for sure. I am not paying iPhone money for a Xiaomi.

That's what I've been doing. I use crush normally. While the codebase are by no means huge, they're not tiny either.