Hacker News new | ask | show | jobs
by ijafri 2885 days ago
I happen to work in an industry (intl shipping) where we scan billions of PDF daily, and I have no clue, how would we share docs with all the parties involved ... 'sanely' ... if it were not for PDF, TIFF isn't good either .. PDF docs are like 3kb-10kb ... other than physical scan of papers ... however 'generated PDF docs' are 5kb at best.
1 comments

It's scary how differently PDFs can be output depending on the exporter. I have some Word docs for user and installation guides that can be 100% larger depending on whether the content writer saves it as PDF, exports it as PDF, or prints the document to a PDF.
Not really. Each of those PDFs has a different intent and thus a different amount of metadata embedded it in.
For the purposes of what we're doing with them, we don't care about any of that, however. The real answer would be to automate the process, so people don't have to think and have the opportunity to do it wrongly, it just happens, but it's hasn't been high enough priority to bother with getting right.