|
|
|
|
|
by jey
4402 days ago
|
|
> Maybe Spark doesn't like 100x growth in the size of an RDD using flatMap? I'd be interested to hear more about your use case and the problems you encountered. It's possible that you need to do some kind of .coalesce() operation to rebalance the partitions if you have unbalanced partition sizes. |
|