Hacker News new | ask | show | jobs
Simular Agent S hits 72.6% success on 369 real computer tasks (human: 72.36%) (os-world.github.io)
2 points by taro666 187 days ago