| HN Mirror

Let's try and keep things separated:

1. The benchmark posted by the OP and the test results posted by Rozado are related but different.

2. Equal opportunity and equity (equal outcomes) are different.

Correcting LLM biases of the form shown by Rozado would absolutely be something the right supports, due to it having the chance of compromising equal opportunity, but this subthread is about GenderBench.

GenderBench views a model as defective if, when forced, it assumes things like an engineer is likely to be a man if no other information is given. This is a true fact about the world - a randomly sampled engineer is more likely to be a man than a woman. Stating this isn't viewed as wrong or immoral on the right, because the right doesn't care if gender ratios end up 50/50 or not as long as everyone was judged on their merits (which isn't quite the same thing as equal opportunity but is taken to be close enough in practice). The right believes that men and women are fundamentally different, and so there's no reason to expect equal outcomes should be the result of equal opportunities. Referring to an otherwise ambiguous engineer with "he" is therefore not being biased but being "based".

The left believes the opposite, because of a commitment to equity over equal opportunity. Mostly due to the belief that (a) equal outcomes are morally better than unequal outcomes, and (b) choice of words can influence people's choice of profession and thus by implication, apparently arbitrary choices in language use have a moral valence. True beliefs about the world are often described as "harmful stereotypes" in this worldview, implying either that they aren't really true or at least that stating them out loud should be taboo. Whereas to someone on the right it hardly makes sense to talk about stereotypes at all, let alone harmful ones - they would be more likely to talk about "common sense" or some other phrasing that implies a well known fact rather than some kind of illegitimate prejudice.

Rozado takes the view that LLMs having a built-in bias against men in its decision making is bad (a right wing take), whereas GenderBench believes the model should work towards equity (a left wing view). It says "We categorize the behaviors we quantify based on the type of harm they cause: Outcome disparity - Outcome disparity refers to unfair differences in outcomes across genders."

Edit: s/doctor/engineer/ as in Europe/NA doctor gender ratios are almost equal, it's only globally that it's male-skewed