Hacker News new | ask | show | jobs
by tomrod 391 days ago
PySpark is a wrapper, so Scala is unnecessary and boggy.
1 comments

PySpark is great, except for UDF performance. This gap means that Scala is helpful for some Spark edge cases like column-level encryption/decryption with UDF