|
|
|
|
|
by rathboma
3668 days ago
|
|
Using Sqoop from something like Luigi as the ETL manager is a pretty great workflow - https://github.com/spotify/luigi You can define dependencies between jobs based on output file which allows you to re-run only part of your pipeline |
|