Hacker News new | ask | show | jobs
by cporios 2787 days ago
> With 85% accuracy, 85 of liars are flagged as lying (correctly), [...]

This is not what accuracy means.

85% accuracy just means that 85% of all the decisions the system makes are correct. A system in such a setting, where a single false negative matters a lot more than a single false positive (which would simply be handed over to a human for further investigation) would necessarily be tuned for extremely high recall at the cost of precision. In other words, it would often flag innocent people for further investigation (as you've said), but it would almost never clear people that should've been flagged.

2 comments

I am not correcting you, but simply illustrating how 85% accuracy tells us very little...

Let's make the spherical cow approximation that "a lie" is a fully defined concept, then we have 4 conditional (bayesian) probabilities:

P( "sincere" | sincere) The probability a sincere person is reported as "sincere".

P( "lying" | sincere) The probability a sincere person is reported as "lying".

P( "sincere" | lying) The probability a lying person is reported as "sincere".

P( "lying" | lying) The probability a lying person is reported as "lying".

The first 2 probabilities should sum to 1, and the latter 2 possibilities too, so we have 4-2 = 2 degrees of freedom. A reported "accuracy" tells us nothing without knowing the distribution of liars and sincere people in the test group..

Ah, yes, you're right. Thanks for correcting my misunderstanding.

https://en.wikipedia.org/wiki/Accuracy_and_precision#In_bina...

Ah, well, yes, looks like I was right, too, though.

Unless I'm mistaken (and that's possible, I've changed my opinion twice now), my example outlined above is

- conceivable, and

- has 85% accuracy (85 people correctly identified as liars, 85% x 9900 = 8415 correctly identified as non-liars, thus a total of 85+8415=8500 of 10k total "accurately" identified), and

- still only 5% or 6% of flagged liars are actual liars.

EDIT to add:

And if the system is tweaked as you suggest, to very rarely fail to flag a liar:

- suppose it correctly flags all 100 liars as liars

- suppose accuracy is still 85%, thus 8500 people in total classified correctly

- thus 8400 non-liars flagged correctly, and the remaining 1500 non-liars flagged incorrectly

Now still only 6.25% (100 of 1600) of people flagged as liars are actually liars. Thus, even with the tuning you suggest, this remains.

(Note to self: 1. think 2. write)

FWIW, I think you are totally correct if you take accuracy at face value.

You really have to compare precision and recall values to know if the accuracy statement holds true. You could have have 100% precision and low recall and still have 85% accuracy (meaning you could never flag someone as lying and be wrong while missing a bunch of liars and still have 85% accuracy).

but if everything is totally evenly distributed, then 85% accuracy means 85% accuracy and your first statement is correct.

The real issue is that accuracy is only one piece of the puzzle.