|
|
|
|
|
by swasheck
596 days ago
|
|
we use partitioned parquet files in s3. we use a csv in the bucket root to track the files. i’m sure there’s a better way but for now the 2tb of data are stored cheaply and we get fast reads by only reading the partitions we need to read. |
|
I feel like you'd get away with the whole thing for around $500/mo depending on how much compute was needed?