Y
Hacker News
new
|
ask
|
show
|
jobs
by
nileshtrivedi
458 days ago
That remains to be seen. Manus, a standard agent built with Claude 3.7, outperforms o3 agentic model on the GAIA benchmark.