|
|
|
|
|
by denysonique
4600 days ago
|
|
Node.js for web scraping usually is the obvious choice: Scraping using jQuery syntax such as: $('table tr').each(function(ix, el) {
names .push($(el).find('td').eq(0));
surnames.push($(el).find('td').eq(1));
})
is more familiar to most web developers as opposed to the PHP syntax.Even if Node was 5x slower than PHP I would still go for Node because of its easy jQuery syntax. |
|
* cheerio (https://github.com/MatthewMueller/cheerio)
* PhpQuery (https://code.google.com/p/phpquery/wiki/jQueryPortingState)
Both of these use a jQuery-esque syntax, so your comment regarding DOM traversal in PHP is a moot point.