Hacker News new | ask | show | jobs
by tfinniga 3155 days ago
Right, like the example from Target. It is possible that facebook has access to a stream of Target purchases or credit card purchase information. I'd be 0% surprised if there was a data sharing agreement with credit rating companies.

Even if they just knew the price of the purchase, combined with the job (burn risk) and location data (traveled from work to nearest store with pharmacy during the day), it's possible that you might be able to infer a burn with enough accuracy to be valuable to advertisers.

6 comments

"It is possible that facebook has access to a stream of Target purchases" ...

It's not possible, it's simply true. In fact, this feature of the facebook ad platform is available to any advertiser on Facebook/Instagram/etc, whether you have $100 to spend or $100M.

You simply upload your customer's purchases to Facebook with data such as zip code, email, etc etc which Facebook then uses to optimize your advertising budget and find more customers similar to the ones you already have (the feature is called look-alike audiences).

"Google said that it captures around 70% of credit and debit card transactions in the US."

source: http://www.bbc.com/news/technology-40027706

If Google has access, I am sure Facebook does it.

I'm sure Facebook would like to do what you're saying, but I think that they're a long way from having those capabilities.

They have billions of users, billions of ads, and trillions of pieces of metadata. Sifting through that to produce a guess like "User 1234 burned themselves and would be interested in product 5678" would be an amazing and scary piece of AI.

Not to mention that Facebook are limited by the ads they can sell. The manufacturer of the burn cream doesn't give Facebook a bucket of money and free reign. They choose how and where the ads are shown. There is currently no option for "Users who recently burned themselves".

Last year there were 29,000 categories that could be targeted. If an ad sales rep wanted to close a deal, how difficult would it be to add one more for recent burn victims, or people who recently bought a specific product?

From the links that m_ke posted, facebook already has ties to loyalty card information, so it's very possible that they didn't need to do any inference, they just had the data directly.

In any case, the main point is that facebook doesn't need to listen to what people are saying, it has a ton of other data streams that could explain the stories in the link.

What would the point be in advertising a burn cream to someone that just bought a burn cream? It's not like it's a common reoccurring purchase (like Toothpaste for example).

Like the article says, it's probably just the Baader-Meinhof phenomenon* at work. It's like when you buy a new car - you suddenly start seeing that same model of car everywhere.

* https://www.damninteresting.com/the-baader-meinhof-phenomeno...

I think the proposed logic here is "a person like you buys burn cream so we advertise burn cream to people like you." In this scenario the person who actually bought the item will get the ad too.

To me, advertising a product to people who already bought it is a sound strategy. I see no contradiction. I'd rather like to hear a cogent argument as to why that is wasted effort, as is usually implied.

In your example, maybe I left the bottle of burn cream at the office. So I will want to get another one on the way home. It's effective to get reminded in my Facebook about that item (and the brand) isn't it? Yes it's creepy but it works.

> facebook doesn't need to listen to what people are saying, it has a ton of other data streams

The same could be said for any of those data streams. But just because Big F has access to those other sources of data doesn't mean it excludes voice recognition.

I'd be very surprised if FB wasn't in this arena. That's how big tech companies work these days — they do what everyone else is doing. Like getting into self-driving cars.

Apple[1], Google, Amazon, MS, etc... all have voice recognition boxes. FB doesn't need to market a box for your living room, it already has one in everyone's pocket.

([1] Apple says it doesn't use any of your Siri words against you. If you don't trust FB's word, you might no trust Apple's, either. But at least on the Mac you can keep dictation off the internet, if you choose.)

I agree that even with an enormous amount of data there are limits to what facebook can realistically infer...I particularly enjoyed the things that facebook’s ad platform determined to be my hobbies when I checked recently: https://m.imgur.com/nWCWn63

I’ve had an account since 2005, even if I use it much less than I used to, you’d think they’d have enough data to do better than that.

Think smaller. They don't need to consider billion users at once, for modeling you can start with hundreds of thousands or few million. Trillion pieces of metadata can be reduced to tens of thousands of features (eg. instead of feeding in the raw status updates, assign each update to some class, put in some mood measurement etc). This reduction does not need to be fully automated, it can be human assisted (humans coming up with ideas on what kind of features to extract). Instead of considering all possible ads, you can start by taking out the long tail of ads with very little interaction data and focus on some specific categories.

Building this kind of predictive models is not black magic. This is what companies do to figure out who they should target on direct marketing campaigns. Applying some magical deep learning dust can quite likely improve the results (in Facebook scale), but it's not mandatory. Computationally the hardest part is building the models. Once they are done, applying them to millions and millions of customers is more straight forward.

When considering what is possible, you need to also consider the stakes. Facebook ad revenue is something like $10 billion per quarter[1]. To make more money, I can think of three ways: add more users, make users spend more time looking at their feed or make more money per ad slot. Better targeting means more money per ad slot since some customers are paying directly based on the clicks. Since improvements lead to better revenue, extra hardware costs are easy to justify.

[1] http://www.adweek.com/digital/facebook-raked-in-9-16-billion...

Patterns like these can definitely be inferred by machine learning, with well principled models.

P(product_class=bandage | job=factory_worker, pharmacy_visit_last_month=1) >> P(product_class=bandage)

Also, Googling something on your phone is so common and easy nowadays that people just forget that they did it. It's literally become an integral part of the thinking process.
My suspicion on this is that stores offer "reward cards" so they can get you to sign something that allows them to gather and sell your data.
Rewards cards / loyalty card are designed to incentivize you to spend more with the stores. Ostensibly reselling data is a lot less valuable than directly trying to convince you to shop more at their establishments
That was true at the beginning of the loyalty card phage. But now the big chains are learning how to data mine and discovering the value of their data to others.

Now your loyalty card data is worth twice as much. Once to the retailer for internal marketing, targeting, purchasing, etc... and again selling it to outside parties.

thanks, sorry to reply to this late. I often wonder if I take a loyalty card and then for example I purchase twice as much double cream as the next person, does this get sold to health insurers who then charge me a higher premium? The permutations are endless eg too much beer. too many condoms. Too much spicy dip.
If you look at the history of the Tesco Clubcard (one of the early ones to appear in the UK), the data they mined from their new ability to tie purchasing trends to customers more than made up for the cost of the rewards being offered to get people to use it.

There's a reason they got so dominant over here - they were simply way ahead of the game on offering what their customers actually wanted, because they were the only people who actually had a clue.

That has of course changed now.

If you read the Walgreens Rewards privacy policy, it covers everything from retail transactions to security camera footage and they are able to tie everything together. They're also able to gather and sell data about your health, which would otherwise be protected by regulation using the rewards program, especially as people are incentivized to use the card, even for prescriptions...and no one actually reads [between the lines] of the privacy policy
I always thought they were up front about that being true. At least for the 1% off cards that supermarkets offer.
That's true and credit card companies are very open about it (at least if you read their reports for investors). The advantage of store cards is that the store knows exactly which products you bought, whereas they only know the total if you used a Mastercard/Visa.
>whereas they only know the total if you used a Mastercard/Visa

This is simply not true with respect to the big retailers in the US. This started as far back as '06, though the details of the implementation have changed a bit.

Only if it's a co-branded card. If you use your Visa issued by some bank to go shopping at Walmart, they won't get your personal details together with all items you bought. They can get the items but without the personal details (including age, wealth, etc), they're not that useful.
We had the ability to do this at the largest merchant acquirer 10 years ago as long as your bank signed up. Walmart was one of the first customers and would stream SKU-level data straight to our servers from the POS terminal.