|
|
|
|
|
by jondwillis
308 days ago
|
|
I don’t think so. The training data, or some other filter applied to the output tokens, is resulting in each model indicating that it is the best. The self-preference is almost certainly coming from post-processing, or more likely because the model name is inserted into the system prompt. |
|