Hacker News new | ask | show | jobs
by tlianza 4531 days ago
If you're interested in hosted solutions that try to do automatic identification of pages, diffbot is worth a look. We've had some good experiences: http://diffbot.com/