Hacker News new | ask | show | jobs
by MagMueller 471 days ago
We extract all the interactive elements from a page like id 1. button id 2. drop-down id 3. textarea.

Then we present this list to the LLM with the task and the LLM outputs input_text(id 3, Hello World).

Finally, we execute the Playwright code to perform the actual action of inputting text into this element.