|
|
|
|
|
by simonw
1753 days ago
|
|
That used to be true, but today, with so many websites operating as SPAs against undocumented APIs, I think it's reasonable to redefine "scraping" to mean extracting data from unofficial APIs in addition to extracting it by parsing HTML. After all, what is a scrapeable HTML page if not a grotesquely convoluted undocumented API with an unstable output format? |
|
The gross inefficiency and low data-to-layout ratio are the key things being expressed through connotations of the word "scrape". To scrape is to extract a small amount of something from a much larger substrate.
To call every query a scrape is to diminish the specificity and utility of the term.