| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by deshpand 1735 days ago

< I hate finding CSVs that other data scientists

Ideally you should be using the parquet format which will use the binary format, preserve column types and indexes [df.to_parquet(<file>); df = pd.read_parquet(<file>)]

You can get away from a lot of problems by simply avoiding text files