| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by siva7 479 days ago
	o3-mini and gpt-4o are so piss poor in agent coding compared to claude that you don't even need a benchmark

2 comments

jbellis 479 days ago

o3-mini-medium is slower than claude but comparable in quality. o3-mini-high is even slower, but better.

link

danielbln 479 days ago

Claude really is a step above the rest when it comes to agentic coding.

link

dr_kiszonka 479 days ago

When I used it with Open Hands it was great but also quite expensive (~$8/hr). In Trea, it was pretty bad, but free. Maybe it depends on how the agents use it? (I was writing the same piece of software, a simple web crawler for a hobby RAG project.)

link