Hacker News new | ask | show | jobs
by jejeyyy77 505 days ago
no, websites/pdfs were designed and laid out visually by humans for humans.

if you are just parsing the text you’ve lost a ton of information encoded in the layout/formatting.

that doesn’t even yet consider actual visual assets like graphs/images, etc