|
|
|
|
|
by chosenken
3203 days ago
|
|
We use airflow mainly for ETL work, backfills, and batch processing. We do a lot of work with clickstream type data, be it taking data from analytics.js and loading it into redshift, or taking analytic data from redshift and loading into Google analytics. We have developed many pipes and connectors for airflow that allow us to connect to many data sources, both at the source and sink ends. I mainly work on the DevOps team, running our infrastructure and working on back end systems, so my knowledge of airflow is more high level. I just keep the system up so it can run ;P FYI I work for a startup out of Cincinnati, OH called Astronomer, you can find us at https://www.astronomer.io Also, we weren't like all down, we just saw lots of time out issues when reading/writing to S3. |
|
Very interesting re redshift => google analytics though, I've never heard of it done in that direction.
Do you think airflow is suitable for one-off/cron task management as well?