| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by chis 209 days ago

Has anyone switched to Gemini CLI? It's so important but also exhausting keeping up with which model is the leading edge. Especially since every model has different idiosyncrasies you have to learn to work with it effectively.

Currently my ranking is

* Cursor composer: impressively fast and able but not tuned to be that agentic, so it's better for one-shot code changes than long-running tasks. Fantastic UI.

* Claude Code: Works great if you can set up a verifiable environment, a clear plan and set it loose to build something for an hour

* Grok: Similar to cursor composer but slower and more agentic. Not currently using.

* ChatGPT Codex, Gemini: Haven't tried yet.

10 comments

embedding-shape 209 days ago

I haven't tried Gemini CLI with Gemini 3 Pro, but pretty much all the others. I usually run four agents at the same time, for each task, giving them the same prompt and then comparing their responses.

Gemini CLI has the lowest rate limits, lowest inability to steer the models (not sure that's a model or tooling thing, but I cannot get any of the Google models to stop outputting code comments constantly and everywhere) and seemingly the API frequently becomes unavailable for some reason.

Claude Code is fast, easy to steer, but the quality really degrades really quickly and randomly, seemingly by time of day. I'm not sure if they're running differently quanitized models during different times, but there is a clear quality difference depending on when in the day I use it, strangely. Haven't found a way of verifying this though, ideas welcome.

Codex CLI is probably what I use the most, with "gpt-5+high", which is kind of slow, a lot slower than Claude Code, but it almost always gets it right on the first try, and seemingly no other model+tool does instruction following as good, even if your AGENTS.md is almost overflowing with rules and requirements, it seems to nail things anyways.