Hacker News new | ask | show | jobs
by tonyhb 3338 days ago
> Reproducibility is a big problem in analytic pipelines, specifically genomics... [Docker solves this problem].

There is a legitimate fundamental issue with reproducing people's setups and Docker solves this. Ergo docker is valuable technology for data science.

Literally typing `docker run...` to reproduce results (vs. VM setup, installation etc.) seems important enough to justify their phrasing.

Fact: there are handfuls of data science startups that essentially rebrand docker and have been funded in the hundreds of millions of dollars specifically to solve this problem.