|
|
|
|
|
by ricardo81
2059 days ago
|
|
Done a fair bit of scraping in my time, mostly with PHP/curl and PHP's DOMDocument if necessary. I'd say to anyone learning how to code it's a good exercise in learning. Think a scraper for most sites can be built in an hour or two, depending on navigation and how data is sent to the client. Definitely noticed a trend towards XHR and JSON responses typically using a numeric ID. Probably the easiest type of site to scrape where you don't need to crawl navigation, simply iterate over a number range and the scraped data is already pretty much structured. |
|