Hacker News new | ask | show | jobs
by nvalis 1528 days ago
Some comments from an ATLAS physicist doing W mass measurements at the LHC:

https://non-trivial-solution.blogspot.com/2022/04/do-we-have...

4 comments

As another ATLAS physicist, I can say that this is an excellent article from Prof. Schott. He is very politely arguing that "someone messed up". I'm not sure I agree so much with the point of combining the LEP experiments, which do have some tension with each other. Unless the combination is specifically taking into account correlations between uncertainties at the different experiments on the same collider (which exist, but it's really hard to handle).

Another take many people in the field are expressing is that it's simply infeasible to reliably interpret statistical models at that level (especially one that is dominated by systematic uncertainty), since they are based on approximations and assumptions e.g. that certain nuisance parameters are "nicely" distributed and uncorrelated. See e.g. comments from Prof. Cranmer [1] who is one of the folks who developed the standard statistical formalism and methods used in modern particle physics experiments.

[1] https://twitter.com/kylecranmer/status/1512222463094140937?s...

Why don't people use nonparametric methods to get around the problem of assuming certain parameters are "nicely" distributed? (Not a physicist, but curious – this seems like the "obvious" solution.)
Nonparametric methods are often used when the assumptions of parametric tests don't hold.

In physics experiments they want to fix the structure of the model and know the assumptions. They want to know the distribution and parameters to hold. If assumptions don't hold, they must find out why, find better assumptions and fix the model.

To say it differently: physicists are not trying to discover statistical laws. They are trying to discover physical laws trough statistics.

It sounds like they know certain assumptions regarding parameters that are not of interest are wrong. So why explicitly model those, if we don't care about their distribution? We (apparently) only care about an accurate estimate of W boson mass.
That works if the thing is something you can remove from the experiment and model separately, then put it back. In CERN many variables are tied to this one huge machine that is one of its kind.
I admit I'm not familiar with the model used to aggregate the boson data. But there's an entire community of nonparametric/semiparametric statisticians that works on problems just like this. It seems crazy to me that that millions of dollars are spent to build the machines to collect this data, yet the papers are written using statistical models with distributional/independence assumptions known to be false. (The tweet linked above seems to be saying something similar.)

Is there a concrete reason we can't be naive and just bootstrap confidence intervals for example? Of course I defer to the physicists here – but I'm curious whether there's some simple high-level reason the usual tricks don't work.

Because it’s all interrelated and too many variables make it impossible to nail down anything with certainty if you don’t assume some invariants somewhere?
> We observed for quite some time some features in the our data, which we could not explain. Once one of my PhD students came into my office and told that he finally figured out this feature: the protons in the ATLAS detector do not collide heads-on but under a very small angle, allowing the not interacting protons to continue their travel through the LHC on the other side of the experiment. Indeed he was right - we have not been considering this effect in our simulations, however - after some calculations and speaking to the machine experts - it turned out that this effect induces a feature in our data, which is opposite in sign that we observe; so we have been left with an effect that was twice as large and unexplained. In the end it turned out to be caused by the deformation of the ATLAS detector by its own weight of more than 7000 tons over time.

I know these people are incredibly smart and conscientious. And the standard model is extremely successful and well confirmed. But that's a lot of degrees of freedom.

"I do not think, we have to discuss which new physics could explain the discrepancy between CDF and the Standard Model - we first have to understand, why the CDF measurement is in strong tension with all others."

That's... cute. I doubt it will stop the theorists from flooding the arxiv with explanaitions in the coming days/weeks. Recall what happened when there was a barely 3 sigma (local) statistical fluctuation in LHC data:

https://resonaances.blogspot.com/2016/06/game-of-thrones-750...

Edit: Thank you for posting the excellent article!

Waiting to see if Tammaso puts something up about it, IIRC he was a CDF member

https://www.science20.com/quantum_diaries_survivor