Hacker News new | ask | show | jobs
by cmckn 2225 days ago
I don't think it's a crazy comparison, but AFAIK Presto doesn't have a shuffle stage, which is fundamental in traditional MapReduce. They're both distributed computations, that's the comparison I would make.

Edit: thinking about it more, how could Presto accomplish joins and other SQL operations without a shuffle? Seems very similar to Spark SQL, which is just syntactic sugar for multi-stage MapReduces.

1 comments

Presto can do shuffles when needed. But it has a smaller max size shuffle than Spark typically because everything has to stay in RAM