Hacker News new | ask | show | jobs
by XCSme 133 days ago
AI traffic is really tricky because traditional analytics and SEO tools just don’t capture it. Even basic UTMs often miss the "last hop" behavior from these systems, so it can feel like you have no idea how users actually reach your pages. One thing that helped us was using a self-hosted analytics setup [0] with session recordings to understand user flows coming from AI referrals. You get visibility into real behavior without sending data to third-party services.

The drawback though, is that just gives you MORE insights into the human traffic, and AI still looks just like spam/robot crawlers.

[0] https://www.uxwizz.com

1 comments

Distinguishing 'AI Research' (crawling) from 'AI Referral' (user clicks) is the hardest part. Most agents (OAI-SearchBot, ClaudeBot) declare themselves in UA, but the actual click-through often strips the referrer or shows as direct/none. We've had some luck correlating 'time of crawl' with 'time of visit' to fingerprint AI traffic, but it's noisy.

Self-hosted is definitely the way to go for raw logs though. GA4 obfuscates too much of this.