Hacker News new | ask | show | jobs
by closed 1441 days ago
Hey--I maintain a port of dplyr to python, called siuba[1]!

Right now it supports a decent number of verbs + SQL generation. I tried to break down why R users find pandas difficult in an RStudioConf talk last year[2].

Between siuba and tools like polars and duckdb, I'm hopeful that someone hits the data analysis sweet spot for python in the next couple years.

[1]: http://github.com/machow/siuba

[2]: https://youtu.be/w4Mi0u4urbQ

1 comments

I learned data analysis in the Hadley/Tidyverse way and constantly struggle when working with pandas. I'll try siuba this week at work.

Just one question, this runs over pandas? It's possible to get the pandas syntax like dbplyr gives the SQL query syntax?

Yeah, it runs over pandas or sql databases! Here's an example querying sql:

https://siuba.readthedocs.io/en/latest/intro.html#Working-wi...

You can use the verbs collect() and show_query() like in dbplyr.

If you DM me on twitter, would love to set up time to hear about your work / walk through siuba!

https://twitter.com/chowthedog