Hacker News new | ask | show | jobs
by sigmoid10 321 days ago
As the other commenter already pointed out, I'll believe it when I see it on the leaderboard. But even then it already lost twice against the winner of last year's competition, because that too was a general purpose LLM that could also do other things.
1 comments

Let's not move the goalposts here =) I don't think it's really fair to compare them directly like that. But I agree, this is triggering my "too good to be true" reflex very hard.
If anything, they moved the goalpost closer to the starting line. I'm merely putting it back where it belongs.