Y
Hacker News
new
|
ask
|
show
|
jobs
by
asutekku
84 days ago
"a harness for a memory" so it still requires external tools to work well. The whole point of this benchmark is to validate the systems can solve problems without any sort of outside help.