| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by causal 236 days ago
	This is my entire problem with Codex - it will spend ten minutes trying to one shot a problem and usually go off the rails at some point, whereas Claude seems much better at incrementally finding the right solution with me.

2 comments

stavros 236 days ago

I've heard this from many people, but I really haven't had this experience. Sonnet will write code that doesn't work, but Codex will give me working code basically every time. It does take longer, and it does think a lot, but I've never seen it go off the rails.

I do look at the backend code it writes, and it seems moderately sane. Sometimes it overcomplicates things, which makes me think that there are a few dragons in the frontend (I haven't looked), but by and large it's been ok.

link

causal 236 days ago

> (I haven't looked)

Oh.

link

stavros 236 days ago

> I do look at the backend code it writes, and it seems moderately sane

Not good enough for you?

link

causal 236 days ago

It's just a different way of approaching the problem, and might partially explain the preference for Codex' style.

link

wahnfrieden 236 days ago

If I'm doing a large task, I use GPT 5 Pro to write a spec first (with advice for Codex, broken down task list, snippets etc). I may also supply entire files/repos as context for 5 Pro to produce this.

If I skip 5 Pro but still have a large task, I have Codex write a spec file to use as a task list and to review for completeness as it works.

This is how you can use Codex without a plan mode.

link

stavros 236 days ago

I still wish it would do all that on its own, without me having to switch models and make sure it won't make code changes.

link

embedding-shape 236 days ago

Well, when you use GPT 5 Pro Mode it can't make any code changes, so not really a problem :)

I have similar workflow as parent, GPT 5 Pro for aiding with specifications and deep troubleshooting, rely on Codex to ground it in my actual code and project, and to execute the changes.

link

wahnfrieden 236 days ago

Codex won't read as much of your code as 5 Pro will (if you give it the context), and Codex will skip over reading in context that you give it (5 Pro can decide what's relevant after reading it all).

Yes Codex is still very early. We use it because it's the best model. The client experience will only get better from here. I noticed they onboarded a bunch of devs to the Codex project in GitHub around the time of 5's release.

link

embedding-shape 236 days ago

> and Codex will skip over reading in context that you give it

That hasn't been my experience at all, neither first with the Codex UI since it was available to Pro users, nor since the CLI was available and I first started using that. GPT 5 Pro will (can, to be precise) only read what you give it, Codex goes out searching for what it needs, almost always.

link

wahnfrieden 236 days ago

That’s what I’m saying. Codex will search but then won’t read full files and is stingy with ingesting context. 5 Pro will take in a lot more context (quality up to about 60k input tokens) but you must give it. So sometimes you can even use Codex first to find what full files you should give to 5 Pro to create the spec/task list.

What my quote meant is that once you have the context Codex needs to do its work, if you give it to it, it’ll start the work right away without going and reading all those files again, which can help minimize context use within a Codex session (by having 5 Pro or just another Codex read in a lot of context to identify what is relevant for Codex instead of having Codex waste precious context headroom on discovery in a session that is dedicated to doing the work).

link

nl 236 days ago

I've noted it elsewhere, but Codex has a plan mode.

On the web, press the "+" button next to the repo

link

wahnfrieden 235 days ago

I can’t use web because I do iOS dev

link