| > 10000 samples are nothing compared to 2^100 possible outputs. It is absolutely, definitely not a "brute search". Testing a small fraction of possibilities (e.g. 0.000001%) is called heuristics, and that's what people use too. Brute searching literally means generating solutions until one works. Which is exactly what is being done here. > Please learn a bit of combinatorics. Don't be condescending - I understand the problem space just fine. Fine enough to realise that the problem was constructed specifically to ensure that "solutions" such as this just won't work. Which is why this "solution" is straight-up broken (doesn't meet the target, exceeds the computationally bounds, etc). > It's ridiculous to demand GPT-4 performs as good as a human. Wasn't the whole point of this prize to spur interest in a new approach to learning? What does GPT-[1234] have to do with the contest rules? Especially since this solution broke those rules anyway? > Obviously its vision is much worse and it doesn't have 'video' and physics priors people have, so it has to guess more times. That's precisely my point - it has to guess. Humans aren't guessing for those types of problems (not for the few that I saw anyway). |
It's similar that a lot of wrong answers are being thrown up, but I think this is more like a probabilistic system which is being pruned than a walk of the solution space. It's much smarter, but not as smart as we would like.