|
|
|
|
|
by occsceo
3838 days ago
|
|
I've been working on a project like this for some time now - and wresting with whether I want to go the community-based vs. closed source model. The problems listed below are pretty exact: huge data sets, lots of cleaning and normalizing, and the snail mail/cd problem is real. Additionally, I'd note that ~40% of the states [somehow] charge for the data...it takes six digits to get a snapshot of all 50 states - and certain states (looking at you FL) say that they do not store the historical, meaning you have to connect with the local BoE's to aggregate the data. A part of me [now] wants to open source this because of the DNC's actions. |
|
Was my thought exactly. Aggregate, then open.