|
|
|
|
|
by arjenpdevries
1695 days ago
|
|
Arrow is meant to share data as-is instead of requiring a copy, and, often, serialization/deserialization. (This requires both ends to be able to handle the Arrow representation.) Eg, it has the potential to speed up query processing in PySpark by a lot, because of its Java/Python interoperability. |
|