Hacker News new | ask | show | jobs
by db48x 1562 days ago
In a PDF, every single glyph could be independently positioned. Most aren’t quite that pathological, but frequently every text run is independent.
1 comments

A PDF can contain metadata that correctly sequences the characters which is great for cut-n-paste, accessibility and reflowing the text. That metadata is frequently absent and the mechanism for incorporating it is by no means simple.