Hacker News new | ask | show | jobs
by changmin 3347 days ago
Thank you, all.

Listly.io is my private work built days ago. I hope to hear opinions if it is useful for you... or not.

Listly.io turn HTML to Excel in seconds without coding. It finds the pattern of repeated structure and extracts all of image links and texts. It does find not tags (table, ul ...), but the structure.

Ideally for developers, I think API would be the best way to adapt this extractor to other scraper or your own scraper.

1 comments

This site is actually really awesome, and has worked for every website I've tried! My only slight issue with it however is it took me a few minutes to actually work out what "HTML codes" were and even then it was only from watching the video. Have you considered renaming it to something like "HTML Source Code"? It also seems to struggle on web pages it can't find tables, such as the following website I made which contains no information:

https://hastebin.com/eguluvoquq.html

I appreciate your hard testing and feedback. Your suggestion is very good to me.

Actually, any (partial or full) HTML source code is available; <div></div>, <p></p>, <span></span>,<html></html>, and etc. Following to your advice, I changed the placeholder description to "any HTML Source Code".

Secondly, my server returns 500 error only if there is nothing to extract such as your code. I will fix it soon. Thank you.