|
|
|
|
|
by int_19h
784 days ago
|
|
It takes less than an hour of conversation with either, giving them a few tasks requiring logical reasoning, to arrive at that conclusion. If that is a strong position, it's only because so many people seem to be buying the common scoreboards wholesale. |
|
I agree scoreboards are not a highly accurate ranking of model capabilities for a variety of reasons.