Hacker News new | ask | show | jobs
by doug_durham 74 days ago
Duplicates are a sign of reality. Only where you have the resources to have dedicated people clean and organize data do you have well modeled data. Pandas is a power tool for making sense of real data.