Hacker News new | ask | show | jobs
by kanary 1025 days ago
Yael does incredible work on this repo & we have contributed in the past (we'd like to more in the future too!).

Doing this work is tough/timeconsuming but can be done DIY. We're working on automating it at scale. YC gave us a grant in 2018 for this work.

Automating this sounds simple but gets extremely complex to scale. - evading cloudflare - solving captchas - managing site footprint (think about google's algo to crawl the web) - ux / product design (how should we communicate to members if a site is an a$$?) - identity matching...

At the end of the day, we need better regulations in the US to help us wrangle the data broker industry. Hand in hand with more advanced technology for monitoring and holding this industry accountable... that seems the best path fwd.