|
|
|
|
|
by premasagar
4675 days ago
|
|
I didn't submit this, but I did oversee the project. Noodle is a Node-based web scraper that also handles JSON, XML and other file formats. It was initially built as a hack project to replace a core subset of YQL. All responses can be served as JSONP, to allow for cross-domain scraping from a website's front-end. Selector queries can be used to grab a subset of a document - e.g. CSS selectors for HTML documents and dot-notation for JSON documents. It lets you request multiple documents in a single HTTP request, and a few other things. I helped to guide an intern, Aaron Acerboni, at my company, Dharmafly, when he built it last year. |
|