Hacker News new | ask | show | jobs
by marcamillion 5863 days ago
This looks interesting, I would definitely use the bookmarklet, because it sucks to have a page always open.

I would have hoped that the original page (including images and stylesheet) were saved locally. Otherwise, this is like an RSS reader, and I am not a big fan of RSS readers for that very same reason.

Just my $0.02.

P.S. Prettifying the landing page would go a long way :)

2 comments

All the html, images, css, and js files are saved locally using HTML5's cache.manifest so everything should load while offline. It is for me at least. What browser version are you using?

Yes, making it pretty is on the todo list.

Thanks for the feedback.

Double checked, looks like I wasn't serving the cache.manifest file correctly. Should be fixed now. Try again. Nice catch.
Hrmm....it kind of works. The issue I am having now, is that on the page I tried it only does part of the page.

http://52weeksofux.com/tagged/week_1

I tried to capture that page, and it only captured the bottom story - not the top.

I never checked the source or anything, so it could very well be something about that specific site.

That was just the first site I tried, and that's what I found.

Hope that helps :)

Ah, that is the result of the Readability, http://lab.arc90.com/experiments/readability/ algorithm, it is a bit greedy and chopped off some of the content.

If there are better algorithms/tools for scraping content please let me know.