Hacker News new | ask | show | jobs
by FergusArgyll 196 days ago
It's very much a vision test. The reason all the models don't pass it easily is only because of the vision component. It doesn't have much to do with reasoning at all