|
|
|
|
|
by jari_mustonen
396 days ago
|
|
The gender bias is not primarily about LLMs but rather a reflection of the training material, which mirrors our culture. This is evident as the bias remains fairly consistent across different models. The bias toward the first presented candidate is interesting. The effect size for this bias is larger, and while it is generally consistent across models, there is an exception: Gemini 2.0. If things in the beginning of the prompt are considered "better", does this affect chat like interface where LLM would "weight" first messages to be more important? For example, I have some experience with Aider, where LLM seems to prefer the first version of a file that it has seen. |
|
As for gender bias being a reflection of training data, LLMs being likely to reproduce existing biases without being able to go back to a human who made the decision to correct it is a danger that was warned of years ago. Timnit Gebru was right, and now it seems that the increasing use of these systems will mean that the only way to counteract bias will be to measure and correct for disparate impact.