|
|
|
|
|
by mej10
3525 days ago
|
|
How fast are group bys? Like say I have 120 billion rows, 25 not-sparse columns, and want to group by between 2-20 columns (5 of which are varchar), aggregating the other 5 columns? What kind of hardware would I need to do that interactively? Or consistently sub-10 second, with 100s of queries per minute. I have built a thing on Redshift that can do some of this, but it has been new territory for me and I am not sure I've done it "right". Constantly looking for alternatives. |
|
You can also look at MemSQL for a distributed relational database with a columnstore. Run enough nodes and you might be able to hit your performance goals.