Hacker News new | ask | show | jobs
Poetiq shatters ARC-AGI 2 benchmark at half the cost (poetiq.ai)
25 points by flavio87 195 days ago
1 comments

Considering the release of GPT-5.2, this article is worth discussing together, as it managed to achieve the same high score as GPT-5.2 using Gemini 3 Pro
Am I crazy to think these models have actually surpassed human performance on ARC 2? https://www.lesswrong.com/posts/DX3EmhmwZjTYp9PBf/ai-perform...
This is not surprising, rather, it's the 100% figure that makes me skeptical. In fact, the intelligence level of ordinary people isn't that high, and AI can indeed surpass it. Otherwise, why would we use it?