Hacker News new | ask | show | jobs
by momojo 192 days ago
Anyone have any thoughts? ARC-AGI (and 2) is pretty much the only benchmark of interest to me anymore, due to its abstract nature.