| There is probabilistic vs. deterministic cross-device tracking. Deterministic assigns a unique device identifier to each device and then uses more data to connect device IDs to an individual. Probabilistic cross-device tracking uses machine learning algorithms to match up devices and identities. For this they can use basically any data that you happen to give them, including behavioral data (you check a website both at home and during transit on your mobile phone, you use the mouse to select text during reading an article, you accidentally gave an application access to your location data and they have resold this, etc.). Probabilistic cross-device tracking can work with and without cookies. Of course ad companies employing these techniques for their customers claim very optimistic accuracy, but know that the accuracy is at least accurate enough to provide them with useful tracking data on individuals. This accuracy will go up if you push ad companies in a corner and confront them with a 10% (or whatever marketshare Apple browsers have) non-cookie-able surfers (as opposed to a fringe small sample of users that block cookies and did so for years). When cookies got banned/required permission in Europe, European websites just started buggering everyone to accept cookies before you were able to read what you were coming for. While everyone already had the option to only allow cookies from trusted domains, now everybody gets pestered with giant pop-ups. Companies also switched to server-side analytics/tracking, or started requiring log-in to track you. If cookies were accepted, one could just join a cookie swap program to mess with the advertisers. Probabilistic cross-device tracking is very hard to avoid, as not using javascript and a general browser like TorBrowser is also an informative fingerprint. And you can't realistically change your browsing habits, which exposes you to gender and age identification (they need this to identify individuals in a household using a single IP). |
Is there any application of machine learning that isn't evil? Genuine question. It seems to be exclusively used to exploit people.