|
|
|
Ask HN: Is there any HTML table scraper generator in python or else?
|
|
2 points
by jeffjia
4622 days ago
|
|
Hi, In one of my projects, I happen to need to get some scrapers running for tens of websites to collect rows, columns of tables (<table>, <ul>, <div>). Those tables are well formatted. I have written several scrapers in python, which basically use CSS selector and then do some simple transformation with regular expression. I just wonder whether there is any scraper generator which may take a url and sample target output as input, and produce a scraper automatically? Any suggestion is welcomed. Thanks in advance. |
|
The webintro example here (https://github.com/ariya/phantomjs/wiki/Examples) scrapes a specific element.