| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by proc0 411 days ago
	Interesting. I see from the video example it took a lot of steps and there is a lot of output for a simple task. I'm thinking this probably doesn't scale very well and more complex tasks might have performance challenges. I do think it's the right direction for AI coding.

1 comments

Yeah, I suppose to esafak's point, perhaps a benchmark for browser agent QA testing would be needed.