|
|
|
|
|
by carlosf
1903 days ago
|
|
I disagree, every ML model has some implicit statistical assumption, which is often not well understood by practitioners. At minimum you must assume your underlying process is not fat tailed. If it is, then your training/validation/test data might never be enough to make reliable predictions and your model might break constantly in prod. BTW shifting distributions and fat tailed distributions are sort of equivalent, at least mathematically. |
|
In some cases if you care about PAC generalization bounds, it's even the case that the bounds do actually hold for all possible distributions.