Hacker News new | ask | show | jobs
by benmanns 1477 days ago
What are people doing with entity resolution/record linkage? At Doximity we use it to match messy physician, hospital, and medical publication data from various sources into more coherent data sets to power profiles and research tools. Mostly with https://dedupe.io/ but with some custom tooling to handle larger (1m+ entities) datasets.
1 comments

Replacing sensitive entity content with pseudorandom seeded junk for subsequent training and transforms in exposed settings. Not the main use case for ER.