|
|
|
|
|
by joshuamorton
1767 days ago
|
|
What you need are more parameters than the complexity of the underlying distribution. If you drop to a linear function you're modelling, you only need a couple of parameters. "Overfitting" is memorizing the training data instead of generalizing. The example you're providing isn't overfitting, it's just generalizing to the wrong function. Overfitting would be if the validation set was, say, 30 random values that you got right, but didn't get other values along the same lines correct. > I didn't specify the training set, just the test set Then unless you constructed the training set with the intent of mistraining the model, I think a training set that got good accuracy on that validation set would generalize. > The point is that it doesn't require trillions of parameters to overfit to a trillion-sized test set. You can't "overfit" a validation set, unless you've done something wrong. Overfitting is, by definition, learning the training set too well such that you fail to generalize to a validation set. |
|