Hacker News new | ask | show | jobs
by bjnord 4920 days ago
I, for one, would be interested in seeing that nokogiri script that turns ugly HTML into markdown... although I'd totally understand if you want to keep it close as a competitive advantage. ;-) I regularly have to deal with the ugly HTML that various apps' "Export" features disgorge.
1 comments

It's so hacky that it's almost certainly a competitive disadvantage.

Converting to markdown is a bit of an overstatement - it only handles the limited content that I encountered in this book.

- H1 heading -> ##

- Strip P tags and add newlines

- Replace 'See SHEET \d+' with [[Article:\d]]