Hacker News new | ask | show | jobs
by occsceo 3838 days ago
I've been working on a project like this for some time now - and wresting with whether I want to go the community-based vs. closed source model.

The problems listed below are pretty exact: huge data sets, lots of cleaning and normalizing, and the snail mail/cd problem is real. Additionally, I'd note that ~40% of the states [somehow] charge for the data...it takes six digits to get a snapshot of all 50 states - and certain states (looking at you FL) say that they do not store the historical, meaning you have to connect with the local BoE's to aggregate the data.

A part of me [now] wants to open source this because of the DNC's actions.

3 comments

> A part of me [now] wants to open source this because of the DNC's actions.

Was my thought exactly. Aggregate, then open.

Would love to hear more and see if we can't collaborate on this. My email is seth AT amicushq DOT com.
i'll fire an email your way shortly. but yea, a conversation would be great.
Not to pollute the thread, but I'm also really interested in this, I've worked at both the Clinton campaign and NGP VAN and it seems like a very worthwhile pursuit. If you're adding people, my email is in my profile.
I've also wanted to get involved in a open voterfile project like yours for a very long time. I'd love to connect as well. My email's my username at gmail, if you're interested.
If you open it up with addresses/phone/email-addresses, beware that the main users may be commercial marketers (i.e. junk mail senders), not campaigns. Also note that many states license the data with a restriction that it only be used for election purposes.
if I define open as, "your campaign would have to register and be verified"...then it abides by the state/fed rules for these datasets. I can't just throw the data on github.
Can you comment on this a bit?

It's not obvious to me how open access to this data would be intrinsicly bad, but I suppose that relies on the assumption that it's equally available to all parties (which may not (definitely not?)) be the case.

any pointers to the statuatory/regulatory guidelines would be appreciated.

The first question I ask people when talking about this project is, "Do you know your voter information is public?" About 85% are shocked and in horror that this information is available. Outside of a campaign, it is hard to say if the public would support it.

I've chatted with a two different lawyers over here in Ohio...and both have advised strict security and election/campaign use only.

a good overview, yes. For example, NY says "Election purposes only", but fails to mention that each infraction is a misdemeanor. And, Ohio is wrong. Ohio is campaign/election use only, also with the misdemeanor kicker.