| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tedivm 693 days ago
	Scraping often uses the same APIs that the website itself does, so to make that work a lot of sites will have to put their content around authentication of some sort. For example, I have a project that crawls the SCP Wiki (following best practices, ratelimiting, etc). If they were to restrict the API that I use it would break the website for people, so if they do want to limit the access they have no choice but to instead put it behind some set of credentials that they could trace back to a user and eliminate the public site itself. For a lot of sites that's just not reasonable.