Hacker News new | ask | show | jobs
by raincole 34 days ago
It has always been like this. We actually know that the model performance has been mostly steady[0], but you cannot beat the notion of "evil companies secretly serving us worse models." The meme value is too strong.

[0]: https://marginlab.ai/trackers/claude-code/

2 comments

Your data support actual strength shifts, not narrative manipulation:

Range of 48-73.5 (peak 53.1+% higher than trough) with a single day shift of ~30%.

You suggest people are usually influenced more by narrative than data, but provide a narrative-heavy, data-light comment, e.g. "always" "know" "mostly steady" (hazy terms for data) "cannot beat" "evil companies" "meme strong".

A followup defining "mostly" and "steady" more clearly, and your purpose in writing in a narrative-shaping style would be helpful.

Hmm, today's pass rate raised to 73% - interesting, are they AB-testing some new model? This is too high for Opus 4.7.