Y
Hacker News
new
|
ask
|
show
|
jobs
by
robotswantdata
82 days ago
How does that work if the scraper takes a screenshot to feed to a LLM or OCR?
2 comments
yummypaint
82 days ago
That seems like a very expensive way to crawl the internet
link
robotswantdata
82 days ago
Scrape normally collect emails, if no email seen take screenshot and OCR OCR is cheap and REGEX is cheap
link
woctordho
82 days ago
It would be interesting to show bf code rather than the actual email on the webpage. A lot of OCR systems struggle with this kind of repeated symbols where the exact count is required.
link