|
|
|
|
|
by palakchokshi
4470 days ago
|
|
I know we can crawl them. There are no technical issues with crawling them. The issue is I want to respect their TOS. There are multiple ways to circumvent their anti-crawling code but that means any new search engine will have its roots in "shady" tactics of crawling.
IMHO crawling should be allowed if the purpose of the crawler is to show results that drive traffic back to the sites and not mashup the content and deliver it as "original content" on the site that crawled these domains. However e.g. Yelp's TOS do not allow for these types of crawlers that essentially drive traffic back to Yelp. |
|
Permitting certain search engines to crawl but not others is anticompetitive and violates the principle of an open web.
> any new search engine will have its roots in "shady" tactics of crawling.
Who cares? If you are successful enough, then you'll negotiate with them later. Don't worry about it now.
Crawl away!