Hacker News new | ask | show | jobs
by premasagar 4675 days ago
I didn't submit this, but I did oversee the project.

Noodle is a Node-based web scraper that also handles JSON, XML and other file formats. It was initially built as a hack project to replace a core subset of YQL.

All responses can be served as JSONP, to allow for cross-domain scraping from a website's front-end.

Selector queries can be used to grab a subset of a document - e.g. CSS selectors for HTML documents and dot-notation for JSON documents. It lets you request multiple documents in a single HTTP request, and a few other things.

I helped to guide an intern, Aaron Acerboni, at my company, Dharmafly, when he built it last year.