I told windsurf to install playwright, identify crucial workflows of the app and add tests for them. Not without my input, but I got what I wanted without getting the hands dirty.
The power here is the coding agent has the ability to test visually if - and like a human would. So if the button isn't visible, the browser agent would use vision to detect that it's missing.
It sorta tests 'just like a human would' to make sure the flow that's implemented works as it's expecting to.
It sorta tests 'just like a human would' to make sure the flow that's implemented works as it's expecting to.