|
|
|
|
|
by sitkack
1285 days ago
|
|
No one uses that format for streamed json, see ndson and jsonl http://ndjson.org/ The size complaint is overblown, as repeated fields are compressed away. As other folks rightfully commented, csv is a mine field. One should assume every CSV file is broken in some way. They also don't enumerate any of the downsides of CSV. What people should consider is using formats like Avro or Parquet that carry their schema with them so the data can be loaded and analyzed without have to manually deal with column meaning. |
|