|
|
|
|
|
by freehorse
541 days ago
|
|
> I am interpreting this result as human level reasoning now costs (approximately) 41k/hr to 2.5M/hr with current compute. On a very simple, toy task, which arc-agi basically is. Arc-agi tests are not hard per se, just LLM’s find them hard.
We do not know how this scales for more complex, real world tasks. |
|
The other benchmarks are a good indication though.