|
|
|
|
|
by zeitlupe
1184 days ago
|
|
Spark is my favorite tool to deal with jsons. It can read as many jsons – in any format located in any even nested folder structure – as you want, offers parallelization, and is great to flatten structs. I've never run into memory issues (or never ran out of workarounds) so far. |
|
Of course while you’re at it, you should probably just convert all your JSON into Parquet to speed up successive queries…