Hacker News new | ask | show | jobs
by mtgx 3864 days ago
No. It will never be effective. Here's why:

> Let's look at some numbers. We'll be optimistic -- we'll assume the system has a one in 100 false-positive rate (99 percent accurate), and a one in 1,000 false-negative rate (99.9 percent accurate). Assume 1 trillion possible indicators to sift through: that's about 10 events -- e-mails, phone calls, purchases, web destinations, whatever -- per person in the United States per day. Also assume that 10 of them are actually terrorists plotting.

This unrealistically accurate system will generate 1 billion false alarms for every real terrorist plot it uncovers. Every day of every year, the police will have to investigate 27 million potential plots in order to find the one real terrorist plot per month. Raise that false-positive accuracy to an absurd 99.9999 percent and you're still chasing 2,750 false alarms per day -- but that will inevitably raise your false negatives, and you're going to miss some of those 10 real plots.

This isn't anything new. In statistics, it's called the "base rate fallacy," and it applies in other domains as well. For example, even highly accurate medical tests are useless as diagnostic tools if the incidence of the disease is rare in the general population. Terrorist attacks are also rare, any "test" is going to result in an endless stream of false alarms.

This is exactly the sort of thing we saw with the NSA's eavesdropping program: the New York Times reported that the computers spat out thousands of tips per month. Every one of them turned out to be a false alarm.

And the cost was enormous -- not just for the FBI agents running around chasing dead-end leads instead of doing things that might actually make us safer, but also the cost in civil liberties. The fundamental freedoms that make our country the envy of the world are valuable, and not something that we should throw away lightly.

https://www.schneier.com/essays/archives/2005/03/why_data_mi...

1 comments

This is a good argument - but I don't entirely buy it - because a system like that does not need to lead the investigations, but can also be an additional source of information. A very powerful one - if the police is already investigating someone than the possibility to browse all his history, his contacts etc. would be very useful.
That's even worse - "Give me six lines written by the most honest man in the world, and I will find enough in them to hang him" - if you're looking for guilty-seeming people, you'll find them.

And if you're trying to prevent atrocities, you'll convict them before they've done anything wrong.