[1] https://github.com/dedupeio/dedupe
[2] https://dedupe.io/
[3] http://www.cs.utexas.edu/~ml/papers/marlin-dissertation-06.p...