|
|
|
|
|
by spathak
3714 days ago
|
|
Sumedh from Citus Data here. > Does Citus keeps it's performance over tables with tens of billions of records? Citus essentially shards the data across machines, and queries these in parallel. You can thus scale out your cluster and CPU cores as you add more data and maintain performance. > Also, how fast it is for ad-hoc queries over data coming from streams (Kafka/Kinesis) that has not been cached? By 'cached', do you mean OS or database caching in-memory? Query performance for on-disk data is as fast as you can get with regular PostgreSQL, since each data node is essentially a PostgreSQL node, and each shard a regular PostgreSQL table. Standard tuning like indexes and Postgres configuration parameters will apply here. |
|