Hacker News new | ask | show | jobs
by karbarcca 2074 days ago
I agree; if I needed to parse CSVs in python and could utilize the arrow format, I would definitely use pyarrow.

I actually recently finished support for reading/writing the arrow format in Julia (https://github.com/JuliaData/Arrow.jl), and it's automatically integrated with the CSV.jl package; so you can do `Arrow.write("data.arrow", CSV.File("data.csv"))` and convert a csv file to arrow format directly. I'm very bullish on arrow as a standard binary data format for the future.