|
|
|
|
|
by paulfharrison
11 days ago
|
|
It's nice to see sparse interpretable LLMs being made. This is similar to factor rotation in factor analysis (or PCA). A varimax rotation, for example, can produce an equivalent factor analysis with sparse loadings, and which is generally more interpretable. Fortunately for us the world is not just a complete mess, and sparse loadings can often be found. There seem to be "natural" concepts that we have observed rather than invented. (Many examples in other simple machine learning methods too, I am sure.) |
|