Hacker News new | ask | show | jobs
by bunderbunder 2566 days ago
Well, some of the issue there is that it's really hard to get a good analysis out of bad or unsuitable data. Garbage in, garbage out.

Generally it's better to put the horse in front of the cart: Figure out what kinds of questions you want to be answering, and then design a way to collect the data you need to answer those questions.

This isn't far off from the lesson that medical science somewhat recently had to learn the hard way, about how just dragnet collecting heaps data and then figuring out what to do with it after the fact will yield far more incorrect conclusions than correct ones.