|
|
|
|
|
by cube2222
2 hours ago
|
|
Relatedly, I think it's worth noting that Anthropic models have consistently been top-scoring in BullshitBench[0], in a league of their own, really. Not affiliated with the bench in any way, but I think it surfaces important differences between the behavior of the models from different labs. TLDR: The benchmark is measuring pushback in response to nonsensical requests and questions, as opposed to going with it and hallucinating a nonsensical answer. [0]: https://petergpt.github.io/bullshit-benchmark/viewer/index.v... |
|