Hacker News new | ask | show | jobs
by bttger 1739 days ago
> Fully agree on Pandas. R's native data frame + tidyverse is world's easier. Pandas' overly complex indexing system is a persistent source of annoyance no matter how much I use that library.

Is it just the syntax/readability that annoys you, or are there actually problems that need like n steps more to do the same with Pandas?

1 comments

I spend more time working around panda's strange isms than it takes me to write vanilla python that does the same thing. The index problems are not just a small annoyances, and sometimes can waste hours because of its awkward defaults. For example, its default in df.to_csv to write an index (without a column name..)! It doesn't make any sense to me whatsoever that reading a csv, then writing the csv would add a new column. I'm really tired of rerunning pandas code after I forget to turn that stupid default index setting off. Is that a small thing? Sure. But it had tons of small things like that.
It's funny you complain about the index being saved in csv files, which is the default behaviour in R.
Not in tidyverse, but yeah indexes in pandas are a souped up version of rownames in base R.