Hacker News new | ask | show | jobs
Open Source Models Score Low on ARC-AGI-2 Reasoning Benchmark (xcancel.com)
2 points by ironyman 103 days ago