|
|
|
|
|
by mgraczyk
1716 days ago
|
|
I worked on the things you mentioned at Instagram, doing the tuning this-and-that for about a year on a few different surfaces including the home feed. You are right that for the most part, people look at metrics and make decisions about what to ship, and only the best engineers and data scientists spend time thinking about the actual product. However, you're wrong about what metrics are important. Since at least early 2020, and to some extent since 2016, there is a hard and enforced constraint on so-called "wellbeing" and "integrity" metrics. Facebook actively measures the sort of things reported in the WSJ piece (self-reported wellbeing) as well as many others like "bullying" (as measured by human reviewers), "known misinformation", "hate speech", etc. When engineers make changes to feed, they are generally not allowed to regress these non-engagement metrics. The focus of many shipping conversations is how to address even unmeasured potential risks to these metrics. A huge number of experiments are run specifically targeted at improving these metrics. |
|
The two concerns that come to mind without deeply understanding the problem is that: 1) Measuring a qualitative, nebulous metric like "wellbeing" (which could mean different things for different people) is likely very hard to do right 2) In my experience, things tend to move fast, and experiments often don't run for _that_ long. I would hypothesize that Facebook's negative effects on users is a compounding effect that emerges over the scale of months. Sure, you can leave a small % of users in a holdout group of your experiment, but how often is that getting revisited?
I do like the idea that there are teams out there that are taking it as a goal to positively move these non-engagement metrics. If FB is going to correct course then steps like this are a big part of that.