| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Peteris 2613 days ago

Kedro puts emphasis on seamless transition to prod without jeopardizing work in experimentation stage:

- pipeline syntax is absolutely minimal (even supporting lambdas for simple transitions), inspired by the Clojure library core.graph https://github.com/plumatic/plumbing

- sequential and parallel runners are built-in (don't have to rely on Airflow)

- io provides wrappers for existing familiar data sources, but directly borrows arguments from Pandas, Spark APIs so no new API to learn

- flexibility in the sense you could rip out anything, for example, the whole Data Catalog replacing with another mechanism for data access like Haxl

- there's a project template which serves as a framework with built-in conventions from 50+ analytics engagements