| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kevincox 1255 days ago
	When I mean crawler I mean something that discovers new pages. Refreshing the same URL isn't really crawling. But yes, it may be the best available solution in this case, even if I would argue that it isn't really it's main purpose.

2 comments

cmatthias 1255 days ago

After reading this and your response to a sibling comment I wholeheartedly disagree with you on both the specific definition of the word crawler and what the "main purpose" of robots.txt is, but glad we can agree that Google should be doing more to respect rate limits :)

link

ddevault 1255 days ago

What you're thinking about, in my opinion, is best referred to as a spider.

link