Hacker News new | ask | show | jobs
by mistercow 4899 days ago
>To have 2 of them (that we know about) stemming from prosecutions by one individual, I would have to say, is pretty remarkable. While it could just be a coincidence, the one common denominator in these two statistically rare incidents is Stephen Heymann.

Careful. You are subtly reusing evidence in an invalid way.

The reason that Heymann's record is under examination in the first place is that Swartz committed suicide while Heymann was prosecuting him. That's the observation that generates the hypothesis "Heymann is more aggressive than the average prosecutor, and this leads to an increase in suicides". So then we test that hypothesis by looking at Heymann's record and seeing if we find more suicides.

So far so good, but what we can't do is then reuse Swartz's suicide to support that hypothesis. Appropriately enough, this is an example of the prosecutor's fallacy (http://en.wikipedia.org/wiki/Prosecutor%27s_fallacy). The key is that we have to keep an eye on the context of the original hypothesis. If someone had looked at Heymann's methods and said "Wow, that's going to lead to people killing themselves", and then looked into his record and said "Willikers! Get a load of this body count!", then they'd be right to count both Swartz' and James' suicides.

But that's not how we got here, so the question has to be "Is the number of suicidal defendants prosecuted by Heymann — other than Swartz — significantly out of line with expectations?"

And that number is one. To answer the question would require some data. I've tried gathering data from different angles, but I'm not sure it's out there. Maybe someone else can take a whack at it.

2 comments

The point of the prosecutor's fallacy isn't that you should ignore evidence, it's that you must include all the evidence in calculations of probability. Swartz' suicide is inductive evidence for the hypothesis "Steve Heymann caused Jonathan James' suicide", you can't just ignore it.
I'm not saying to ignore it. I'm saying not to double-count it.

>Swartz' suicide is inductive evidence for the hypothesis "Steve Heymann caused Jonathan James' suicide",

That's actually a separate hypothesis.

Can you describe the point at which double counting is occurring? My hypothesis is, "Steve Heymann's defendants are more likely than others to commit suicide". I propose to determine with what likelihood a criminal defendant will commit suicide, determine with what likelihood Steve Heymann's defendants have historically committed suicide, and compare the two numbers. How does excluding Aaron Swartz from consideration make my conclusion better reflect reality?
I a court of law, or over a dinner with a mathematican, you are absolutely right.

In real terms -- in getting the prosecutor removed from office, in stopping these kinds of mindless overuse of federal reach, it is best to count both, and use the nickname 'Suicide Steve'.

I think it warrants investigation. If we have sufficient outside evidence to support the hypothesis, then of course go after him. But if the truth is that this is par for the course, and all prosecutors are driving defendants to suicide, then it would be very bad to crucify this one guy. Nothing stops progress like a good scapegoat.