Hacker News new | ask | show | jobs
by techn00 580 days ago
It doesn't, you cant find any reference to the captcha solving or anything like that in the repo. There's one line for the stealth plugin that is commented out:

// chromium.use(stealthPlugin());

imo this is the hardest part about scraping, evading bot detection and captchas.

edit: and keeping the scraping logic & rules up to date

1 comments

Hi, creator here.

So our open-source version does not provide cloudflare bypass or captcha support. It is impossible to have a robust system for the same completely FOSS. But we do have it available in our cloud version (which we launch soon, currently in testing). Our open-source version allows you to BYOP (Bring Your Own Proxy) to handle all the bypassing. Our OSS version is being used by users with small to medium scraping needs :)

All of this is mentioned in our README.md.