|
|
|
|
|
by theredsix
102 days ago
|
|
Hi HN, op here! This is an open source browser protocol for LLM agents. The browser shows the model the current page, the model chooses the next action, and the browser returns the new state. Between steps, JavaScript and time are frozen so the page stays still while the model thinks. That makes things like ecommerce shopping and popup-heavy web app workflows much more reliable. Using this setup, the project gets ~90% on Online Mind2Web. My bet is that browser agents need a protocol designed for models, not just wrappers around CDP. |
|