|
|
|
|
|
by shawnz
3499 days ago
|
|
I am no math expert, but I have always thought about it like this. The squared error is like weighting the error by the error. This causes one big error to be more significant than many small errors, which is usually what you want. Am I on the right track? |
|
That's correct.
> which is usually what you want
Unless you have outliers, in which case it's what you don't want. So you add e.g. a Huber loss function to reach a compromise.