Hacker News new | ask | show | jobs
by gwern 3133 days ago
Subsetting will also increase the variance of each datapoint (consider the extreme case of picking a subset which was 0 or 1 visits per day), so is probably not a win. It's also hard to imagine what subset properly reflects all sources of traffic and so is informative about the total effect of advertising. Search queries definitely is not it.