|
|
|
|
|
by mystique
4124 days ago
|
|
Good list. I am new to Machine Learning with only ~1 year of real work and sometimes I slip and make one of these mistakes. I have a question on #7. I have not used the co-efficients to mean feature importance but some times get tempted to use them. How do you explain which factors are the most important factors behind some outcome to non-stat people? |
|
But the article gives two points why you shouldn't use coefficient values to determine feature importance, which I think are only valid to some extent.
>a) changing the scale of the variable changes the absolute value of the coefficient
and
>(b) if features are multi-collinear, coefficients can shift from one feature to others.
Regarding a), well, that's what standardized coefficients are for.
b) is a bit trickier, but most regression models are based on the assumption of non-collinearity. This is of course a problem with real-world data, because you will quite often find some level of collinearity. That's when you (1) test for this issue and (2) look towards multilevel models.