Hacker News new | ask | show | jobs
by beforeolives 1906 days ago
They aren't synonyms but they sometimes overlap.

- data engineering involves more work on data transformation and developing different pipelines

- data engineering requires more knowledge of databases, cloud environments or different streaming tools (it gets close to being a backend developer in some places)

- data engineering doesn't involve any statistical modeling, data science does

- data science is a broader term - depending on the company a data scientist might be doing all the data engineering work (if it isn't too much) + the model work and statistics. Or they might be focused entirely on research, statistics and ML models

1 comments

Thanks, that clarified it.

Do you know how people typically get into that role?

For us, depends on the seniority of the role, but we've had good luck bringing in people coming from both directions (where I define the "directions" as "software engineering" and "data science/analysts")

Analysts and junior data-science types can often make the transition well if they can beef up their engineering skills (i.e. learn to write tests, make stuff that will be maintained for years)

Software engineers are often a good fit too if they can pick up some of the data skills (get really good at SQL).

Probably really depends a lot on the specifics of the position, sometimes "data engineer" means "write sql queries to apply business rules" and sometimes it means "maintain our interesting in-house ETL applications which were written in Java 8 years ago"

I'd also value the soft skills a lot if I were to be hiring data engineers - so much of the job tends (at least where I am) to be correctly interpreting business rules/needs and anticipating potential future use-cases.

Hmm, so it might be out of reach for me. I have a PhD in pure math, and no experience as a software engineer. I've coded for research, but never for production.
You can move over from being a software engineer to a data engineer pretty easily. Or you can be a data scientist who had some exposure to that kind of work and move over to data engineering quite easily too.