|
|
|
|
|
by ACow_Adonis
2008 days ago
|
|
worked with stats, machine learning and data science for 10+ years now. never heard the term used until now. (that's not to say I'm not familiar with the things the term refers to, indeed, most of the intellectual frameworks I've worked with break each of the things that make up provenance into far more fine grained concepts). course, I've also never heard of or touched the software you listed there either, but that may be because I don't view the data science and machine learning I'm interested in as being about specific software or vendor software... sounds more database- lingo to me... |
|
"Provenance" just means where the data came from. [1]
It's one of those shibboleths and terms of art used by people in industry. If you go to trade-shows you'll hear it being used -- it's worth knowing if nothing else but for its sociological value among the data software tools crowd.
Side: it's a little like the word "inference" being used as a verb by folks in AI (example usage: we use GPUs to speed up model "inferencing") -- in AI, to inference means to "predict". It's a term of art. If someone with a traditional statistics background went to a deep learning conference, they are likely to be very confused because in traditional statistics, inference means to obtain parameters θ in a model y = f(x,θ), whereas in AI, inferencing refers to obtaining y.
[1] https://en.wikipedia.org/wiki/Data_lineage#Data_provenance