| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by dunefox 1438 days ago
	There really is a lot of cherry picking, etc. going on in this area. Papers released without code and weights or even data make reproduction and validation nearly impossible.

2 comments

forgingahead 1438 days ago

Yeah it's stunning to me that people can apparently run experiments with code, produce results with code, write a paper about that code, and then release poorly written prose in a garbled way without also releasing that code (or at the very least releasing a video demonstrating results).

link

kfarr 1438 days ago

...And get a PHD for it!

link

the_only_law 1438 days ago

I was under the impression this is just how academia worked nowadays.

link

harha 1438 days ago

There are really just two situations if the solution generalizes well - and if it doesn’t it might be worth just mentioning that and to move on:

1. similar open data exists, great, just publish a sample implementation

2. if not the first task is to generate such an open data set

Edit: formatting

link

time_to_smile 1438 days ago

There's also the problem that most complex neural networks are highly sensitive to initial weights. My friends and I have frequently tried to reproduce famous papers and it's remarkable how often getting the initial settings nearly exactly correct is the key to achieving the targeted bench mark.

This is a problem because cherry picking is essentially built into the frame work.

If I was building ranking algorithm and just kept picking a random seed to arbitrarily sort a list of numbers until it was correct, most people would consider that obviously cheating. However if I did the same thing but stuck 3 dense matrices between the seed and the list to be ranked it would considered AI.

link