|
|
|
|
|
by modeless
1034 days ago
|
|
Isn't this the study that asked a bunch of questions with the same answer ("yes") and basically the old model always answered "yes" and the new model always answered "no"? That's not a degradation in performance. It was never answering the questions in the first place, just guessing. The only thing that changed was the default guess. |
|
https://arxiv.org/pdf/2307.09009.pdf