Hacker News new | ask | show | jobs
by scarecrowx 2981 days ago
We're using Spark on EMR with Data Pipeline to do ETL and to run Scheduled Jobs. Data pipelines terminates the cluster once ETL or job gets completed, helps us a lot to save cost.