The voice-ai-device startup Rabbit seems to have a lot of browser automation stuff in their research side, they're calling their stuff a Large Action Model: https://www.rabbit.tech/research
But you cannot try/download etc it right? We need open source stufff for things that control computers via a layer of vague human language. In my opinion of course.