Hacker News new | ask | show | jobs
by kylediaz 962 days ago
I actually did a more empirical approach to this problem recently. https://kylediaz.com/post/scraping-emails-hackernews/#llm-sc... I had similar results. Email obfuscation like name [at] domain [dot] com is trivially found by both regex and LLMs, but emails like name@[my domain] or name(delete me)@domain.com are harder for LLMs. LLMs could find those emails, but wouldn't try to obfuscate it no matter how I prompted it. It's probably a skill issue on my part, and I'll have to try that neat "think step-by-step" trick.

IMO, the best and easiest way to obfuscate is using invisible HTML elements: name<span style="display: none">you can't see this</span>@domain.com It's technically scrapable (as I show in my post), but it gives way more "security" than [at]/[dot] while still allowing users to just copy/paste it.