Hacker News new | ask | show | jobs
by willowwd9 34 days ago
What's the need of this? I run codex in loop and it writes and runs the playwright tests without any intervention.
1 comments

This is what teams are doing today. But LLMs have a tendency to greedily write tests, which leads to hacky tricks to make the test succeed.

agent-qa is a harness where playwright works as an execution kernel and LLM works as a observer, planner and verifier.