Hacker News new | ask | show | jobs
by leptons 2059 days ago
This article is woefully incomplete and only covers a very specific limited use case for web scraping.

It doesn't mention puppeteer or why you may need to use something like that. It doesn't mention cookies or sessions or anything like that. And it doesn't mention using proxies or any web scraping countermeasures. It's very easy to make crawling difficult, and only very basic sites are easy to crawl with the methods described in the article.

1 comments

I was thinking this too. This article really shouldn't be upvoted. I can't really give away any of the secret sauces though :)