|
|
|
|
|
by aratno
2048 days ago
|
|
ML-assisted query optimization is super interesting to me (I've had to fight the uphill battle against anomalies in Postgres stats and estimations[1]), but I'd also love to see more optimization and auto-tuning across the "stack": - Use column-oriented storage for columns that are frequently scanned in analytical queries - Automated index management, based on user-provided boundaries for write throughput, disk footprint, query latency - Calibrated optimism for heap updates. AFAIK, current DBs are either optimistic (assume transactions are more likely to commit than rollback) and update the heap in-place, and write the old value elsewhere in case of rollback, or pessimistically write all updates in new, versioned tuples and let MVCC (and garbage collection) handle the rest. Would be interesting to see the performance improvement that could come from modeling commit / rollback outcomes and optimizing accordingly. - Using variable-sized pages to reduce overhead for older pages, as a sort of light cold storage Anyone know of any DBs that automatically tune these aspects? [1]: http://www.vldb.org/pvldb/vol9/p204-leis.pdf |
|