Show HN: Resurf – realistic, reproducible test framework for AI browser agents
5 points
3 days ago
| 0 comments
| github.com
| HN
Systematic testing of browser agents today is not easy: testing on real websites is flaky, rate-limited and potentially expensive (e.g. using proxies or bypassing Captcha), while static-HTML benchmarks lack state and dynamic behavior.

Resurf gives your browser agent a realistic, stateful, instrumented framework — built on synthetic websites with failure-mode injection:

- Realistic, dynamic, interactive environment - Deterministic & reproducible - Failure-mode injection (latency, payment errors, 5xx) - Auditable success eval (DB state, not LLM judge) - No dependency on live websites - Browser Use and Stagehand supported out of the box

No one has commented on this post.