|
|
|
|
|
by cheez
2223 days ago
|
|
I read the post and had a couple of points: 1. Is DuckDB similar to having indexes on each column? Because generally when something is slow, the solution is indexes. I have a 100 GB database which records real time data and is lightning fast because of some minor tuning. 2. The example of STDDEV not be available shows the author's unfamiliarity with SQLite which worries me. https://docs.python.org/2.7/library/sqlite3.html#sqlite3.Con... Could very easily have made a similar interface if necessary. |
|
ad 1) An index is an additional data structure that takes space and needs maintenance when the data changes. This is especially problematic for realtime data. DuckDBs main data structure and implementation is designed for efficient scanning without needing an additional index ("vectorised columnar").
ad 2) We support STDDEV_SAMP and STDDEV_POP. There is also an interface to define your own functions. We are very aware of how SQLite is doing things and even support their client API. But we also might eventually add their way of adding UDFs from Python.