| > Measuring is undervalued Goodhart's law has been a problem with measures. “When a measure becomes a target, it ceases to be a good measure.” The problem is that a competitive measure tends to overshadow the utility of the measure (look at anything that starts with a TPC- or the first rotation of a Tesla roadster tire). So the biggest impact of a measure is immediately after it is introduced, because the market hasn't engineering itself to overfit to that measure. And that is what Dan's post clearly says, that immediately after he brings up something, there is an actual change in the industry practice that happens. > it doesn't show off creativity. But something like Jepsen is amazingly useful because it is a creative nothing-up-my-sleeves end-run around first-party testing - actual third party testing for the product is the red-team to a QE team doing the blue-team work. The problem is that a lot of people leave that to the same vendor who is selling the product with its promises from their sales team - it is hardly an open book into the bugs open list (I work in open-source, so I have to wash my dirty laundry in public). As a side not to this, I had a meeting in the last 2 days where someone told me that the "QE team is doing a good job, because the test have been consistently green as the release approaches". And I didn't know exactly how to respond to that. |