|
|
|
|
|
by pen2l
3156 days ago
|
|
It's been quite a while since I last did web-scraping (I used to use BeautifulSoup, more than a decade ago). I'm just wondering, since a lot of people are using fairly advanced cloud-hosting solutions with, I assume, tools offered by their respective hosting place to fight spam, is web-scraping a lot different from what it used to be about a decade ago? What steps do you guys take to prevent being identified as a bad actor by the place that you are scraping? And on the other end, if you have a data-rich website, what are your feelings toward aggressive scrapers? |
|