| Came here to post this, fantastic paper and probably the most comprehensive thing on the internet available about this Something else I've taken away from research about columnar and vectorized databases: there doesn't seem a good reason why they aren't fit for OLTP workloads Analytics from SaaS/line-of-business apps shows about a 90/10 read/write ratio for CRUD apps. Pavlo et al. have a solid paper on an HTAP database that compares OLTP, OLAP, and a novel HTAP storage on varying read/write workloads: > "Arulraj, J., Pavlo, A., & Menon, P. (2016). Bridging the Archipelago between Row-Stores and Column-Stores for Hybrid Workloads. Proceedings of the 2016 International Conference on Management of Data - SIGMOD ā16. doi:10.1145/2882903.2915231 "
See "Figure 17: Concurrent Hybrid Workloads ā The impact of the storage layout on the query processing time under different concurrent hybrid workloads."The chart contains 4 workload types, one of which is "⢠Read-Heavy: 90% scans, 10% inserts" You can see that the columnar storage model outperforms OLTP N-ary storage here So my question is -- why don't people use columnar databases for CRUD apps? |