|
|
|
Show HN: Resurf – realistic, reproducible test framework for AI browser agents
(github.com)
|
|
5 points
by andrew_zhong
45 days ago
|
|
Systematic testing of browser agents today is not easy: testing on real websites is flaky, rate-limited and potentially expensive (e.g. using proxies or bypassing Captcha), while static-HTML benchmarks lack state and dynamic behavior. Resurf gives your browser agent a realistic, stateful, instrumented framework — built on synthetic websites with failure-mode injection: - Realistic, dynamic, interactive environment
- Deterministic & reproducible
- Failure-mode injection (latency, payment errors, 5xx)
- Auditable success eval (DB state, not LLM judge)
- No dependency on live websites
- Browser Use and Stagehand supported out of the box |
|