Hacker News new | ask | show | jobs
by robotswantdata 82 days ago
How does that work if the scraper takes a screenshot to feed to a LLM or OCR?
2 comments

That seems like a very expensive way to crawl the internet
Scrape normally collect emails, if no email seen take screenshot and OCR OCR is cheap and REGEX is cheap
It would be interesting to show bf code rather than the actual email on the webpage. A lot of OCR systems struggle with this kind of repeated symbols where the exact count is required.