| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by joshmlewis 318 days ago
	I am convinced. I've been giving it tasks the past couple hours that Opus 4.1 was failing on and it not only did them but cleaned up the mess Opus made. It's the real deal.

3 comments

diego_sandoval 318 days ago

On that same vein, I had just tried Opus 4.1 yesterday, and it succesfully completed tasks that Sonnet 4 and Opus 4 failed at.

link

joshmlewis 318 days ago

When it came out on Tuesday I wanted to throw my laptop out of the window. I don't know what happened but results were total garbage earlier this week. It got better the past couple days but so far with gpt-5 being able to solve problems without as much correction I'm going to use it more.

link

alfalfasprout 318 days ago

Interesting, I've had the complete opposite experience. Opus 4.1 feels like a generational improvement compared to GPT-5.

link

joshmlewis 318 days ago

It is funny how it can be like this sometimes. I think a lot depends on coding styles, languages, prompting, etc.

link

energy123 317 days ago

And it's almost 10x cheaper via flex, and in #1 position on lmarena. It's not even close.

link