Hacker News new | ask | show | jobs
by spacephysics 803 days ago
As I understand, you’d need to know all the confounding factors to account for variations?

What about multi-variate problems? For instance many studies used red meat intake in the form of burgers from fast food as “red meat” full stop, and concluded that causes cancer. Not really paying attention to the seed oils or mountains of other ingredients in a SAD which are banned in other places yet allowed here. Not including the problems with pesticide use.

I really don’t trust meta analysis or statistical gymnastics to draw causal conclusions from nutrition studies given the wide array of factors that we really can’t account for. Genetics of the individual, environmental history (high protein with someone who has a history of kidney disease is bad vs an average person for example)

I’m more concerned over pesticides, preservatives, and even storage containers produced in the last 75 years to cause cancer and other issues than a source of food that we’ve and many others have lived off for some thousands of years. Plants and meat included.

Unfortunately there’s a mistrust in many of these institutions, but I think for good reason. They need to get their act together (see sugar industry’s collusion with scientist that asserted saturated fats cause heart disease). Not to mention the enormous p-hacking and reproducibility crisis.

2 comments

There’s a recent twin study from Stanford where they took a bunch of identical twin, and gave one an omnivorous diet and the other a plant-based diet. Both were also supposed to follow an exercise routine. For the first several weeks they had pre-prepared healthy meals, before switching to individuals preparing their own meals with guidance/advice.

Vitals were taken and blood work was done before making the diet/lifestyle changes, and again after.

> As I understand, you’d need to know all the confounding factors to account for variations?

Not necessarily, the idea of these methods is to basically find natural ways to group your study pop such that confounders are matched 'automatically'. Regression discontinuity analysis, for example, should handle most confounders outside of some egregious edge cases. The idea there is find a continuous variable which thresholds people into receiving the treatment or not, and then limit your analysis to only individuals on 'the cusp' of falling above or below the cutoff: idea is that these are people who basically fell into or out of the treatment due to random chance, so they form a quasi-experimental population. If this assumption is satisfied, then the confounders will be handled as if treatments were assigned truly randomly (classic application of this method is college scholarships assigned based on SAT scores or GPA). It's hard to speak super generally about the efficacy of these methods, because it does hinge strongly on the particular assumptions of each method being satisfied, but I will say that as a whole they're very well attested to in the professional stats literature and have a long and respected history in ex. econometrics.

> I really don’t trust meta analysis or statistical gymnastics to draw causal conclusions from nutrition studies given the wide array of factors that we really can’t account for.

That's fair, again I can't really cosign any of the results in the world of nutrition because I have no experience with that field; just wanted to chime in with some additional context about stats methods. I will say that, if properly done, the interpretation of a causal inference technique should be pretty straight up, they're not like some black box method. For example the discontinuity regression I mentioned above is simple enough conceptually that I think it's actually pretty strong evidence, even it seems like 'gymnastics' at first glance. However you are of course free to draw whatever conclusions you like regarding the rigor of the analysis or the propriety of the methods.

> They need to get their act together

No doubt, I would actually point to causal inference methods as a very promising avenue for the field to improve in this way (if they haven't embraced them already). Generally dealing with questions of causality explicitly in your methods (IMO) makes it way harder to just p-hack your way to a conclusion; you have to actually think through the mechanism of action and how it's represented in your model.

edit: Forgot to address one question:

> What about multi-variate problems?

Methods definitely exist for this, but in general multivariate stats is just harder than single variate stuff. Seems like the example about red meat vs. seed oils is more of an omitted variable concern than what I'd usually refer to as a multivariate question, and to this I'd again point out that if a quasi-experimental method is properly employed confounders should be handled (though this presumes a bunch of assumptions are met, and ideally the analyst should apply some diagnostics to check that they are).