Hacker News new | ask | show | jobs
by Wowfunhappy 9 days ago
> And no the solution here is not computer vision with an LLM.

Also, even if you hypothetically wanted to use computer vision with an LLM… what API is that LLM going to use to take screenshots and click on stuff?