Hacker News new | ask | show | jobs
by RossM 2901 days ago
Their ML approach is most likely based on their https://github.com/scrapy/scrapely project, which uses instance-based learning (as I understand it, lightweight ML) to scrape other pages from a few examples.