|
|
|
|
|
by sandGorgon
2988 days ago
|
|
I don't think so. Python and data frames is arguably more natural to think about and reason than scala. I have no doubt that scala is more performant and the "fat" jar mechanism makes dependency management and codeshipping very easy (it's still tricky to install python dependencies on your spark nodes), but the pandas ecosystem is definitely more intuitive to understand. |
|