Hacker News new | ask | show | jobs
by minimaxir 2543 days ago
In practice, if there's ever a case that there's "too much" data such that dplyr starts to hang (e.g. millions of rows, hundreds of columns), you would get better value by setting up a database first with the data. Which you can then query with dbplyr!
1 comments

data.table can handle millions of rows easily, as long as the data can fit in the memory.