|
|
|
|
|
by Cyndre
5139 days ago
|
|
Here is my approach on how to find scrappers. They are already supplying fake data to see if they are being scrapped. Using this fake data they can find all the sites that are using their scrapped data. Congrats we now know who is scrapping you with a simple google search. Now comes the fun part. Instead of supplying the same fake data to all, we need to supply unique fake data to every ip address that comes to the site. Keep track of what ip, and what data you gave them. Build your own scrapper's specifically for the sites that are stealing your content and scrape them looking for your unique fake data. Once you find the unique fake data, tie it back to the ip address we stored earlier and you have your scrapper. This can be all automated at this point to auto ban the crawler that keeps stealing your data. But that wouldn't be fun and would be very obvious. Instead what we will do is randomize the data in some way so its completely useless etc. Sit back and enjoy |
|