Hacker News new | ask | show | jobs
by nileshtrivedi 458 days ago
That remains to be seen. Manus, a standard agent built with Claude 3.7, outperforms o3 agentic model on the GAIA benchmark.