Hacker News new | ask | show | jobs
by proc0 411 days ago
Interesting. I see from the video example it took a lot of steps and there is a lot of output for a simple task. I'm thinking this probably doesn't scale very well and more complex tasks might have performance challenges. I do think it's the right direction for AI coding.
1 comments

Yeah, I suppose to esafak's point, perhaps a benchmark for browser agent QA testing would be needed.