Hacker News new | ask | show | jobs
by hermitcrab 1288 days ago
>R is extremely slow at a lot of tasks, for one thing, even more than Python.

Base R is quite slow. R + data.table is faster than Python + Pandas in a benchmark that I did recently.

For a 1 million row CSV file, Read + Sort + self-Join + Write took on a Windows box:

Base R: 47.56s

Python + Pandas: 6.44s

R + data.table: 2.99s

More details at:

https://www.easydatatransform.com/data_wrangling_etl_tools.h...

1 comments

I think you've commented on the wrong thread. :)
Oops. That was meant to be a comment on the 'Every modeler is supposed to be a great Python programmer' thread.