|
|
|
|
|
by mrtranscendence
1611 days ago
|
|
I stand corrected. The conversion felt relatively slow to me, but it was a large dataset and there were definitely missing values. Overall the benefits to speed and API cleanliness might be worth it, though it feels a bit gross to convert Spark to pandas to Polars to NumPy to DMatrix. That said, it’s so much better than pandas for data manip that I’ll probably still try to use it. Are you the author? If so, thanks for being so responsive on GitHub. You fixed basically every issue I had almost immediately back when I was learning Polars. It was awesome. |
|
But I will improve it. ;)