Hacker News new | ask | show | jobs
by dapperdrake 365 days ago
Data cleaning depends on the problem domain.

Compare output from a spoctrometer (or spectrograph) vs. eliminating outliers from an almost linear process. One will wreck your data and the other is the only correct thing to do.

         *         
**** ****