Hacker News new | ask | show | jobs
by mindcrime 3318 days ago
Then what about the following sentence?

This must be the case because the generalisation performance can vary significantly while they all remain unchanged.

Maybe it was just me, but I read an implied "alone" in the sentence you quoted, ie:

"Or in other words: the model, its size, hyperparameters, and the optimiser, alone, cannot explain the generalisation performance of state-of-the-art neural networks."