Hacker News new | ask | show | jobs
by muzani 486 days ago
It has been done:

https://youtube.com/watch?v=shnW3VerkiM

https://youtube.com/watch?v=VQhS6Uh4-sI

First one is more impressive looking. Second one more reliable.

I think the real hard part is nobody wants to maintain these, and nobody really wants to pay to use them either. It's a lot of work and not something people do for free. It's no surprise these emerged (and won) in hackathons.

All the major operating systems are dedicating their full efforts into this, so it doesn't make much sense to actually raise money and do it.

2 comments

I see so basically saying this problem is too hard for a small team but the big labs will figure it out?
I read it rather that this is an economic question not a technical question.

There's no point working on a 3rd party feature if the OS will have that feature built-in. Economically that greatly reduces the likelihood of a return.

Especially in a market where customers expect, and will continue to expect this for free.

Yup, economics. Dude in the first video got a car for 3 months of work but he has no interest in continuing work on it now that he won.

Dude in the second video is trying to turn it into a startup, because he didn't get rewarded. He built it before Claude & OpenAI did theirs. So individualw can work faster than extremely well paid teams. But apparently people haven't shown that much interest in the product, so his interest is waning in building it.

Sorry but these are more preconceived demos, not software products that will automated anyone out of a job (yet).

I think it's coming but people are underestimating how hard GUI understanding and action control is in general. For specific RPA, it'll be great and find initial uses there.

makes sense, could you elaborate on the complexity? would like to understand what makes it a hard problem, thx!