Hacker News new | ask | show | jobs
by p_sherman 5073 days ago
Remember when data mining used to mean 3 000 000 records....
1 comments

In some ways, a smaller data set is harder to deal with than a larger one. For example, the surprising effectiveness of Bayesian statistical analysis is mostly due to the huge amount of data you have to throw at it.
On the other hand, with 300 records you can grab a pad and a pencil, and manually tick blue, yellow, blue...