|
|
|
|
|
by get
3044 days ago
|
|
TLDR: Because it gives more weight to one big error then to multiple small ones with the same sum. We want the errors to be noise and not systematic. Noise usually has a gaussian distribution. And in a gaussian distribution multiple small values are more likely than one big one. |
|
Imagine these two predictors:
So Predic1 was better then Predic2? No. Because correctly predicting the one outlier shows more predictive power then staying close to the average. Therefore we use SumOfSquerrors: This shows that Predic2 is "better" and we are happy :)