Hacker News new | ask | show | jobs
by shaqbert 1369 days ago
The main impediment to companies adopting entity resolution tech is the incentive structure. Companies want to show growing user numbers, transactions, leads, order, etc. Alas if you look closely and sift out the dupes/frauds, your growth looks a lot less expressive. So why look closely?
1 comments

I think the point there is that you want to have clean data. Sure for you numbers it would look better if these are bigger - like for twitter and the bots... However you also need to see the other side - the operational one. If we have the same customer 5 times in your data, you will also target that customer 5 times with the same marketing initiative, you will have 5 times the costs etc.

Coming to compliance it even gets worse. If you have to answer a GDPR DSAR and you have 5 different records for one person, but only show one, then you can get into serious trouble with the authorities and also pay high fines.

So I think less high quality data is worth more than a lot of trash data.

here https://www.linkedin.com/feed/update/urn:li:activity:6931955... you can read more about the mentioned twitter ER problem.