|
|
|
|
|
by fluidcruft
4153 days ago
|
|
> In other words, matrix factorization is the real deal. It’s the stuff that separates true data scientists from charlatans — the data alchemists, data phrenologists, and data astrologers of the world I certainly hope this is sarcasm. Matrix factorization is, like, the go-to tool for people that don't know anything about what they are studying (i.e. it's the phrenologist's favorite weapon). Factor a matrix, throw the results out there, slap on some perfunctory "discussion" that has no real mechanistic insight. Boom. Published. But maybe I'm describing "the stuff that separates true scientists from data scientists". Data science manifesto: The purpose of computing is numbers. |
|
Here are some examples:
Principal Component Analysis - SVD does dimensionality reduction where some n% of variance should be accounted for.
One layer Autoencoder - SVD done by a neural network
Latent Semantic Analysis - SVD on td-idf matrix we interrupt lower dimensions as having semantic importance
Matrix Factorization - SVD only now we interrupt lower dimensions as representing latent variables
Collaborative Filtering - SVD where we interrupt lower dimensions as representing latent variables AND we use a a distance measure to determine similarity.