Hacker News new | ask | show | jobs
by CytokineStorm 5940 days ago
How is one supposed to run any sort of machine learning algorithm with only two seasons of data? I could understand throwing the stats from the last 15-20 seasons into Weka and seeing what it said about 2010, but seriously how useful is only 2 seasons worth of data going to be?
2 comments

The data there has the scores from ~5000 games played over the course of each season, and the model he links to also seems quite reasonable to me: http://blog.smellthedata.com/2009/03/data-driven-march-madne...

Don't think of it as two data points. Think of it as two data sets.

And being college teams, their ratings can change drastically over the course of more than a year or two...