Hacker News new | ask | show | jobs
by scotty79 132 days ago
Kinda sus that least known model did best and none of the more recent models were tested. Capabilities grow very fast. So things that now routinely succeed rarely ever succeeded even half a year ago.
1 comments

I mean performance is so bad across the board that this is likely essentially random. Monkeys accidentally doing a bit of Shakespeare.
That's wildly overestimating what monkeys can do on a typewriter.

It takes a lot to just be mediocre. Which, don't get me wrong, I'll agree current ML is, it's just that "mediocre" is an incomprehensibly huge step up from "random".