|
> the PDF is not a format fit for sharing, discussing, and reading on the web. PDFs are (mostly) static, 2-dimensional and non-actionable objects. It is not a stretch to say that a PDF is merely a digital photograph of a piece of paper. It is too far a stretch, murdering the poor subject: PDFs are the best format available for long-term information, such as research papers. They have the advantages of digital data: Searchable, copy-able, transmittable, and data is extractable. They are also an open format, don't rely on a central service to be available, and they preserve presentation across platforms. They have metadata, and are annotatable and reviewable. And the PDF format is the best for long-term preservation, carefully designed to be readable in 50 years - partly because they preserve presentation across platforms - and that includes the metadata, annotations, and reviews. PDFs are like paper in that they will look the same 50 years from now as they do today, unlike (almost?) any other digital format. Yes, I wish they were a bit more dynamic in layout, and that the text was more cleanly extracted. |
That's true for plain text (in the best case), but try extracting an equation, table or a diagram.
Stepping away from best case, PDFs in theory look the same everywhere, but turn into a mess on buggy implementations or differing rendering engines – due to the insistence on having a stable presentation, they assume positining and sizing always works, so when that fails, it fails worse than a buggy rendering of a presentation-agnostic document like an HTML page.
(In my experience, bugs either enter just before printing, or when displaying using JS-based renderers).