Release date seems like a terrible x axis with how much more compute they are using. Not to mention while
I like what METR is trying to measure, it is an uber specific metric. And frankly, me just complaining, they’re prompts I feel do most of the work for the AI. I’ve never gotten as detailed instructions as they give the AI for the task
Whilst true, if you had unlimited compute 5 years ago, we wouldn't be anywhere near Mythos level purely because the technology behind the models wasn't refined enough.
It is really hard to believe you actually believe this unless there really is this class of people that are so addicted to social media that they have confused performativity with actual thinking.
[1] https://metr.org/time-horizons/