|
|
|
|
|
by mattcollins
254 days ago
|
|
I'm the person who ran the test. To hopefully clarify a bit... I intentionally chose input data large enough that the LLM would be scoring in the region of 50% accuracy in order to maximise the discriminative power of the test. |
|