| HN Mirror

This is what teams are doing today. But LLMs have a tendency to greedily write tests, which leads to hacky tricks to make the test succeed.

agent-qa is a harness where playwright works as an execution kernel and LLM works as a observer, planner and verifier.