Hacker News new | ask | show | jobs
by codingslave 2160 days ago
Just recently implemented connected components in a spark job that runs on a 100TB of data.
1 comments

How long did DFS take? In my experience, with Spark it's like factor of 100 slower than what one would expect on a single machine and a sizable dataset.