|
|
|
|
|
by karbarcca
2081 days ago
|
|
It can make huge amounts of difference in a production system; where I work, we process terabytes of csv data every day; saving minutes per file can add up to enormous differences in CPU cost/time for a production system running 24/7. I agree that for a data scientist doing exploratory analysis locally on their computer, it doesn't make nearly as much a difference (also because they're usually not working on crazy large files). The performance work in the CSV.jl package (that the article is about) was very much geared towards these kinds of production scenarios. |
|