| I don't see nearly enough complexity in this analysis to justify the claim of having found any insurance fraud. Firstly, there's no account for correlation between the features identified. The article mentions VINs which have several single-vehicle accidents, for example, but someone who has one single-vehicle accident is probably more likely to have another. Switching coverage is another of those potentially-correlated features; if you claim and it bumps your premium, aren't you likely to shop around as a result? Secondly, there's no attempt to account for the law of large numbers. It's incredibly unlikely that someone has three single vehicle accidents in a year, but because the probability of that is nonzero, we know that with enough vehicles on the road then someone is going to do it. The article covers itself by acknowledging this, of course, but if you title your blog post "We Found Insurance Fraud in Our Crash Data" then you should actually do that. |