|
|
|
|
|
by kostaj
17 days ago
|
|
That's a valid point. During the preliminary research, we did try also more explicit prompts (with explanation for each of the 4 buckets), as well as a five-bucket rubric (with Abstain option). Will show in a follow-up paper how the concise vs explicit prompt impacts the distribution of the verdicts and the level of disagreement. One issue to note with the longer prompts is that they open to much room for discussion around the exact prompt used. Probably we should preregister the prompt before running any further tests. |
|
If you let it spew out an explanation along with the answer, I'm curious if the accuracy will improve (I suspect it will).