Hacker News new | ask | show | jobs
by RobinL 512 days ago
Has anyone tried to link and dedupe the various datasets using a probabilistic linkage tool like Splink?

https://moj-analytical-services.github.io/splink/

(Disclaimer: I am the lead author, but the tool is FOSS)

1 comments

We've used various methods over the years, but we'll check this one out. Thanks!