I think the point was that you need to ensure a representative sample and that requires an idea of the scale of the fraud taking place to be able to measure its impacts. I have no idea if that's possible without insider data or not, but I suspect that was the point of the argument, and not which substance was being tested for in the imaginary body of water being used as an example.