|
|
|
|
|
by kermatt
1044 days ago
|
|
Moving between Spark and Pandas can cause type casting as well. For example the range of allowable dates in Pandas is much smaller than in Spark. We completely abandoned Pandas in favor of PySpark for this reason. It seems unnecessary to use multiple dataframe implementations when Spark is already in play. |
|
https://pandas.pydata.org/docs/reference/api/pandas.Timestam...
(pandas design choice was to support nanosecond times, for financial data.)