|
|
|
|
|
by srean
3549 days ago
|
|
> But if your model is sufficiently expressive you don't need to explicitly build or model the corruption process This is the claim that I am having trouble with. Say I have two random variable X,Y with some joint distribution. If a corruption process can mess with the samples drawn from it, I cannot see how it could possibly recover either the joint or the conditional. Are you saying that the corruption is benign like missing at random or missing completely at random ? Then its much more believable. |
|
Now the question arises; what if things are more complex?
In real life they always are; both your biasing factor and the rest of the model. So we've cooked up all sorts of fun models like SVMs, random forests and neural networks to analyze such complicated models and find hidden features and relations that we didn't think of. Bias is one such feature.
If I built an algorithm that learned to display different ads to mobile and desktop people (i.e., treat mobile "time on site" differently from desktop "time on site"), would you be surprised by this?