Hacker News new | ask | show | jobs
by subbu_devhub 154 days ago
@focusedone I tried reader mode in several browsers it was such a hit or miss, it just did not work for me, and honestly I wanted to convert to markdown not just plain text

I tried several reader modes, there were several issues including * several potions of the main content was missing * the navigation bits get caught when in reader mode * the comments and other un-related sections come in play

I really tried these before invesitng time in this

1 comments

Oh, cool! How does this do with the intentionally obfuscated sites?
@focusedone if you see the code link here https://github.com/subranag/declutter/blob/main/src/page.ts there are some specific techniques recommended to simulate normal browsing behaviour, but it does not work 100% percent of the times, but works on most of the sites

for example * simulate scrolling after page loads * simulate plugins * simulate location etc

once all of this is done hopefully the HTML content becomes readable