Hacker News new | ask | show | jobs
by joshmlewis 318 days ago
I am convinced. I've been giving it tasks the past couple hours that Opus 4.1 was failing on and it not only did them but cleaned up the mess Opus made. It's the real deal.
3 comments

On that same vein, I had just tried Opus 4.1 yesterday, and it succesfully completed tasks that Sonnet 4 and Opus 4 failed at.
When it came out on Tuesday I wanted to throw my laptop out of the window. I don't know what happened but results were total garbage earlier this week. It got better the past couple days but so far with gpt-5 being able to solve problems without as much correction I'm going to use it more.
Interesting, I've had the complete opposite experience. Opus 4.1 feels like a generational improvement compared to GPT-5.
It is funny how it can be like this sometimes. I think a lot depends on coding styles, languages, prompting, etc.
And it's almost 10x cheaper via flex, and in #1 position on lmarena. It's not even close.