Hacker News new | ask | show | jobs
by DanyWin 822 days ago
Thanks! Funny thing, we did not use Vision models but text only with the HTML of the current page. However, we intend to add it to boost performance
1 comments

Interesting that it’s not vision based, I suspect you will get much better performance once vision is incorporated, using e.g LLaVa style models