Hacker News new | ask | show | jobs
by fizx 3710 days ago
https://github.com/fizx/parsley/wiki looks pretty similar.

Running this sort of thing as a service/api never panned out for us because you are almost universally robots.txt denied and/or blocked.

We briefly tried, and supported a wiki of json extraction scripts at parselets.org, but it went nowhere after a few months.