Hacker News new | ask | show | jobs
by xrisk 2105 days ago
Is there a similar study done on the physical sciences? I’m getting a bit of holier-than-thou feeling from this article.

Edit: from all this talk of reproducibility, I wonder what percentage of cutting edge ML research is reproducible (either from lack of public training sets / not enough compute)

5 comments

There are definitely studies criticizing ML publications similarly. As a kind of statistics (but often without the rigor), screwups make ML methods appear better than they really are. Hence the literature is packed with screwups.

Other CS subfields that get a lot of criticism are "network science" and bioinformatics.

I've been playing with some ML lately. I'm sure some of this is because I only have a high level understanding of what I'm doing, but it really feels like throwing things at the problem and see what sticks and is fast. Add features, remove features, try different network architectures, different activation functions, etc. I think the best understood thing might be overfitting, which is oddly reassuring.
> ...throwing things at the problem and see what sticks and is fast.

That generally isn't enough to get published nowadays though, at least in a simple sense (in a broader sense that process might describe all research, of course). To get published requires some deeper demonstration of a new kind of method that not only works, but is superior to all that preceded it in some important way. In other words you show your new method compared to other methods, where yours must be better. Obviously, bad research here can show a supposed advantage by either doing a unfair job applying competing methods, or overfitting their new methods. Or both. As I understand it, the first one is quite common: comparing a poorly-tuned old method to a carefully-tuned new method.

Well yes. I remember hearing that there are now meta-networks, networks that optimize other networks by mechanisms you described.
ML has the benefit though of a rapid turnaround time from Academia -> Industry. Things that work/replicate will be immediately used to make money. Things that don't work will be abandoned pretty quickly (at least outside of ML researchers).
There are tons of replication issues across the sciences, they are just most salient in the Social Sciences because the topic is just really hard to study well.

Clinical trials can often be flawed, even if the stats are fine, just in how they sample. For example, women are often excluded from trials due to hormonal changes, but how drugs impact women is really important! Participants are also typically drawn from specific locations, and so may not be representative of people with different diets, lifestyles, and environmental factors.

Physics has its own controversies, though not always directly related to replication. For example, Harry Collins recounts the social factors involved in the discovery of gravitational waves: https://blogs.sciencemag.org/books/2017/03/28/harry-collins-...

Biological sciences are more often than not just as difficult to reproduce, mostly due to the difficulty of controlling living organisms, the somewhat random nature of the outcome, and p-hacking.
He mentions that epidemiology has actually more severe problems than economics. Having read some epi papers I understand why. Not sure if you'd count that as a physical or social science though: at least theoretically it's biologically based, but in reality the data it works with is mostly social and demographic.