Hacker News new | ask | show | jobs
by Major_Grooves 1039 days ago
Indeed, unfortunately with the John Smiths of this world there will be false positive matches. What we could do is add that they need to be from the same town/postcode, but then that is quite an unreliable attribute too.

Similarly, there are a lot of false negatives where we know two records should match, but we could not because that would require a rule that would create more false positives.

In the end, it was the best we could do with the public view of the data. If we were working with the data Companies House actually holds itself, it would of course be much better.