|
|
|
|
|
by nchammas
859 days ago
|
|
There is something I don't get about the Polars DataFrame API. https://docs.pola.rs/user-guide/migration/spark/ Look at the examples on this page of the Spark vs. Polars DataFrame APIs. (Disclaimer: I contributed this documentation. [1]) Having used SQL and Spark DataFrames heavily, but not Polars (or Pandas, for that matter), my impression is that Spark's DataFrame is analogous to SQL tables, whereas Polars's DataFrame is something a bit different, perhaps something closer to a matrix. I'm not sure how else to explain these kinds of operations you can perform in Polars that just seem really weird coming from relational databases. I assume they are useful for something, but I'm not sure what. Perhaps machine learning? [1]: https://github.com/pola-rs/polars-book/pull/113 |
|