Y
Hacker News
new
|
ask
|
show
|
jobs
by
deviation
129 days ago
I'm impressed with the Arc-AGI-2 results - though readers beware... They achieved this score at a cost of $13.62 per task.
For context, Opus 4.6's best score is 68.8% - but at a cost of $3.64 per task.