Hacker News new | ask | show | jobs
by dtjohnnyb 2667 days ago
Interesting discussion on this article over on the machine learning reddit https://www.reddit.com/r/MachineLearning/comments/awu86b/dis...

I particularly agree with the comment saying:

no software is capable of trawling through the bowels of the organisation to find out the correct interpretation of the `Extra2` field on the `Sales` table that takes three values: "TRUE", "Error" and null.

This, data cleaning, and understanding how best to store the data for better insight are the true bulk of data science work, very little is the shiny model building work. I guess this is close to the Industry specialist as outlined in the article though

1 comments

That more likely falls in Data Engineering part.