|
|
|
|
|
by fifilura
771 days ago
|
|
Yeah the big benefit is that it requires very little setup. You create a new partitioned table/location from the originally mapped file using a CTAS like so: CREATE TABLE new_table_name
WITH (
format = 'PARQUET',
parquet_compression = 'SNAPPY',
external_location = 's3://your-bucket/path/to/output/'
) AS
SELECT *
FROM original_table_name
PARTITIONED BY partition_column_name
You can probably create a hash and partition by the last character if you want 16 evenly sized partitions. Unless you already have a dimension to partition by. |
|