|
This can't be understated. I started using it heavily earlier this summer and it felt like magic. Someone signing up now based on how I described my personal experiences with it then would think I was out of my mind. For technical tasks it has been a net negative for me for the last several weeks. (Speaking of both Claude Code and the desktop app, both Sonnet and Opus >=4, on the Max plan.) |
As an example I’ve been using an MCP tool to provide table schemas to Claude for months.
There was a point where it stopped recognizing the tool unless mentioned in early August. Maybe that’s related to their degraded quality issue.
This morning after pulling the correct schema info Sonnet started hallucinating columns (from Shopify’s API docs) and added them to my query.
That’s a use case I’ve been doing daily for months and in the last few weeks has gone from consistent low supervision to flaky and low quality.
I don’t know what’s going on, Sonnet has definitely felt worse, and the timeline matches their status page incident, but it’s definitely not resolved.
Opus 4.1 also feels flaky, it feels like it’s less consistent about recalling earlier prompt details than 4.0.
I personally am frustrated that there’s no refund or anything after a month of degraded performance, and they’ve had a lot of downtime.