Hacker News new | ask | show | jobs
by K0balt 339 days ago
Can multimodal llms read the pdf file format to extract text components as well as graphical ones? Because that would seem to me to be the best way to go.