Y
Hacker News
new
|
ask
|
show
|
jobs
by
ogogmad
550 days ago
> They can't go beyond the boundaries of their training set.
TFA says they just did. That's what the ARC-AGI benchmark was supposed to test.