Hacker News new | ask | show | jobs
by mtrn 2868 days ago
Luigi is a great Python library for building multi-stage pipelines with parameterized tasks. It easily extend to new storage types and targets.

We currently use it to build a moderately complex product from dozens of data streams (files, APIs and things in-between) and millions of records. At its core is a DAG and topological sort, which capture the essence of pipelines and execution.

A DAG is easy to visualize, even its development over time[1].

[1] https://imgur.com/a/RuC0B5Y (generated from the projects' code, in this case https://github.com/miku/siskin)