| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dreis_sw 205 days ago
	I agree with your sentiment, this incremental evolution is getting difficult to feel when working with code, especially with large enterprise codebases. I would say that for the vast majority of tasks there is a much bigger gap on tooling than on foundational model capability.

1 comments

qingcharles 205 days ago

Also came to say the same thing. When Gemini 3 came out several people asked me "Is it better than Opus 4.1?" but I could no longer answer it. It's too hard to evaluate consistently across a range of tasks.

link