Hacker News new | ask | show | jobs
by rahulj51 1983 days ago
Cost will definitely be a factor but if it's not then I'd suggest Redshift because of the simplicity. Another aspect is how the data will be consumed - Analysts are typically more comfortable using SQL + Redshift. I'd avoid Spark except for very specific use cases as the operational toil is higher with Spark.
1 comments

the consumption is very SQLish based. "operational toil is higher with Spark" <- I agree with this, this is why I'm leaning towards Redshift. Throwing away half the codebase in favor of some SQL statements is tempting.