Hacker News new | ask | show | jobs
by famouswaffles 522 days ago
>visual llm works on textual descriptions

SOTA V-LLMs do not work on textual descriptions.