Hacker News new | ask | show | jobs
by pydry 384 days ago
This looks like a recipe for making test flakiness even worse.
1 comments

It's quite stable on simple applications, but the flakiness definitely increases with more complex apps. I think there is a lot to do to fix it, one idea is to trim the accessibility tree to a narrow region that is relevant, then use it in a prompt. For example, if you want to "sort by price", the LLM doesn't really need to know about anything beyond the filters region of the app.