|
|
|
|
|
by gwittel
1908 days ago
|
|
Lots of general reasons, inertia, etc. Often companies stick to 1-2 preferred technologies and adding Presto isn't seen as a gain (even though it helped Facebook quite a bit). I also suspect Amazon re-packing it as Athena reduced adoption to some extent. If looking at Presto (now Trino), the main thing to keep in mind is that you inherit the limitations of the underlying data store. Its best when the underlying store (+ the Db adaptor implementation) lets you parallelize work and keep each node busy, and avoid processing data unnecessarily. Hive/S3 columnar format data works great for this (IIRC this was a major early use case). Other sources like RDBMS will have natural limitations. Kafka has its own issues since each query generally means re-scanning a topic, etc. I see the data bridges as most useful as a way to bring data into the native/optimal format. Then do the heavy lift work in Presto. |
|