|
|
|
|
|
by sixdimensional
124 days ago
|
|
I still don't understand what happened to using Apache Avro [1] for row-oriented fast write use cases. I think by now a lot of people know you can write to Avro and compact to Parquet, and that is a key area of development. I'm not sure of a great solution yet. Apache Iceberg tables can sit on top of Avro files as one of the storage engines/formats, in addition to Parquet or even the old ORC format. Apache Hudi[2] was looking into HTAP capabilities - writing in row store, and compacting or merge on read into column store in the background so you can get the best of both worlds. I don't know where they've ended up. [1] https://avro.apache.org/ [2] https://hudi.apache.org/ |
|