|
|
|
|
|
by mindcrime
3406 days ago
|
|
For large datasets all the momentum seems to be moving towards Spark (sparklyr is RStudio's SparkR integration. Worst case, you can always use MPI with R and run on a Beowulf cluster. Of course that might not help if you want to use a function from a library, and the library itself expects everything to be in memory on one node, but at least it gives you another option for parallelization. |
|