Hacker News new | ask | show | jobs
Ask HN: Where do companies like outpan and semantics3 get their data from?
4 points by joshpen188 3228 days ago
1 comments

Outpan here,

We use a combination of polite massive-scale web crawling and user contribution. We had an in-house web crawler for years until recently when we released it as a stand-alone service.