Hacker News new | ask | show | jobs
by shoto_io 1881 days ago
Hi Thomas, thanks for your work. I am a fan!

Just a minor comment: I find using decimals places (like 16.1% and 21.1%) in human experiments pretty irritating. It feels like false precision.

After all, these experiments must have confidence intervals. If I had to guess, I’d assume at least a +/- 5 ppts variability in all these numbers.

What’s your view on that?

2 comments

Thank you so much!

I agree that the figures can vary for many reasons and we shouldn't expect them to be exactly the same (some things we don't end up controlling for).

At the same time, if we take the experiment of the soup for example:

They measured 9,227 sales of it so the 21.1% increase is quite robust and I'd expect the error margin to be much lower than 5% either way - so in some ways the precision is warranted.

I also feel that if I were to round a 21.4% to 20% I'd be miscommunicating the findings of the research :)

You are confusing confidence intervals (used to say that you are confident the increase is positive at all) with error margin (the false precision in the 3rd significant figure).
You are right, my bad! Thanks for catching that in the comment above, I've updated it
That does not bode well for the rest of the methodology...
Mixing up terms in casual conversation happens to everyone, and is not a sign of a meaningful problem.
I don't know why people downvote you:

//> we finally have an absolute number of sales measured, but no way of knowing - representing all sales within a period or just cherry picked?

//> for the rest of the population, did it reduced sales?

//> was there any randomised test or not, because in the latter case there could be other biases we're unaware of

//> the increase is compared to what exactly?

//> any WHY is purely speculative as what was measured was WHAT people did. Internal motivation is in this case unproven, there is just a potential correlation.

//> confuses me to hell people taking about error margins and confidence intervals for something measured directly.

"if I were to round a 21.4% to 20%"

Right, but you'd round 21.4% to 21%, not to 20%.

You're assuming that you can only round to the nearest 1% which is not true at all. If you round to the nearest confidence interval and the CI is 5% then you would round to 20%. That said, I would prefer both the precise number and CI communicated together like sibling comment mentions.
Or ideally keep 21.4% and include the 95% CI. There’s nothing misleading about the extra precision when the CI is included.
Agreed. Actually, I wish the norm was to communicate percentages with confidence intervals by default, because I feel like the tacit implication is colloquially "100% CI unless communicated otherwise."
Thanks for explaining! Your point about miscommunication makes a lot sense to me!
> I find using decimals places (like 16.1% and 21.1%) in human experiments pretty irritating. It feels like false precision.

Whether you say 21.1%, 21%, or 20%, you still have a single number. You could make an argument that decimal places like 21.147258% add clutter, but without an actual measure of uncertainty, all you're doing is reporting a summary of the data in the sample with different amounts of arbitrary rounding. That's not particularly helpful as a substitute for the full distribution.

It's not just a number. It's a string of characters conveying information about a measured value. The way it's specified conveys information about the number of significant figures (https://en.wikipedia.org/wiki/Significant_figures).
Significant figures are not a universal standard, they are an imperfect way to in-band information about confidence, by sacrificing precision.
21.1% is implying a different confidence interval than 21%. At least to me.
Yup, 21% and 21.0% do implicitly convey different information even though they’re the same number.