|
|
|
|
|
by kostaj
16 days ago
|
|
Tried initially with a fifth bucket, Abstain. It was actually heavily used by some of the models. But it felt as if they are using this to "avoid" some of the hard questions, and we dropped this bucket to force them to provide a verdict. |
|
do you not see how that creates extremely misleading and valueless results? you are coercing the results into what you want to see.