Hacker News new | ask | show | jobs
by codeviking 1736 days ago
That's the idea!

If all goes well we won't need this software anymore. In a best case scenario the publishers start accepting HTML, and gone are the days of having to convert PDFs to something better...!

1 comments

How do you define pages in HTML?
We don't. We extract the content and present it as a single document.

Page anchors can be used for navigating between sections. We present a table of contents that makes this easy. For instance:

https://papertohtml.org/paper?id=6f9fc51102cf49bff4f4e2b3367...

Great initiative, HTML is the way to go!

It would be great if you could add some basic CSS rules for print? Right now navigation elements are needlessly repeated on each page, obscuring the content.

Also, you forgot to include bold and italics webfonts, so you have faux-styles for all headings and emphasis.