Hacker News new | ask | show | jobs
by cubefox 256 days ago
The "catch" is that TRM is a very small model and a relatively narrow architecture, which shows that the ARC-AGI benchmark doesn't actually test for AGI. (Which the ARC guys kind of admitted themselves by releasing a "-2" version and working on a "-3".)