|
|
|
|
|
by mattcollins
254 days ago
|
|
I'm the person who ran the test. The context I used in the test was pretty large. You'll see much better (near 100%) accuracy if you're using smaller amounts of context. [I chose the context size so that the LLM would be scoring in the ballpark of 50% accuracy (with variation between formats) to maximise the discriminative power of the test.] |
|