Hacker News new | ask | show | jobs
by mangecoeur 3380 days ago
In my experience pyspark is much more flaky and annoying that doing parallel computing with more 'python native' tools. It only really makes sense when you outgrown small clusters and really need huge infrastructure.
1 comments

What python tools do you use for small clusters?
Dask would be an option.
Was going to say that. Or ipython parallel if you want to go lower level