|
|
|
|
|
by nl
2139 days ago
|
|
Not sure what you think this shows, but there's a lot of reasons why the results they show don't really matter much - or at least might actually reflect an accuracy approximatly what a human would also achieve. Their headline claim is a "1 pixel change reduces accuracy by 30%". The test process for that number is this: We choose a random square within the original image and resize the square to be 224x224. The size and location of the square are chosen randomly according to the distribution described in (Szegedy et al., 2015). We then shift that square by one pixel diagonally to create a second image that differs from the first one by translation by a single pixel. So... they are taking a random square, downsampling to 224, moving and then predicting on that subset of the original image, and measuring the performance against the accuracy of the original prediction. What this seems to show is that "CNNs aren't as accurate at making predictions on subsets of an image as on the whole image". This is of course to be expected, and is exactly how a human would perform. There are a bunch of other criticisms too. Read the ICLR (for which it was rejected) reviews: https://openreview.net/forum?id=HJxYwiC5tm |
|