Y
Hacker News
new
|
ask
|
show
|
jobs
by
shadowgovt
823 days ago
This has the potential to be a step towards the missing scripting language for graphical interfaces, which is great.
1 comments
DanyWin
823 days ago
Thanks! Funny thing, we did not use Vision models but text only with the HTML of the current page. However, we intend to add it to boost performance
link
jerpint
822 days ago
Interesting that it’s not vision based, I suspect you will get much better performance once vision is incorporated, using e.g LLaVa style models
link