|
|
|
|
|
by SkiFire13
262 days ago
|
|
> Zonemap / skip indexes don’t require sorting That's still a separate index though, no? It's not intrinsic in the column storage itself, although I guess it works best with it if you end up having to do a full-scan of the column section anyway. > Sorting is even better, just at the cost of a second copy of the dataset.
> ...
> In a row-based rdbms, any indexing whatsoever is a copy of the column-data So the same thing, no? |
|
> So the same thing, no? Consider it as like: for a given filtered-query, a row-based storage is doing a table-scan if no index exists. There is no middle ground. Say 0% value or 100%.
A columnar database’s baseline is a decent index, and if there’s a sorted index then even better. Say 60% value vs 100%.
The relative importance of having a separate, explicit, sorted index is much lower in a columnar database, because the baseline is different. (Although maintaining extra sorted indexes is a columnar database is much more expensive — you basically have to keep a second copy of the entire table sorted on the new key(s))