|
|
|
|
|
by xnx
777 days ago
|
|
As a novice, is there a benefit to using custom Node as the downloader? When I did my download of the 40 million Hacker News api items I used "curl --parallel". What I would like to figure out is the easiest way to go from the API straight into a parquet file. |
|
As for the Arrow file, I'm not sure unfortunately. I imagine there are some difficulties because the format is columnar, so it probably wants a batch of rows (when writing) instead of one item at a time.