|
|
|
|
|
by xmaayy
1650 days ago
|
|
I think it's more likely that 5 came out because if it ever saw the answer, 105, before, it was split into the tokens [10][5] of which it only 'remembered' one. Or the numbers were masked when training (something that was done with BERT-like models) so it just knew enough to put a random one in |
|
What moved me to post is that that kind of silly answer is the exact sort of shenanigans that I would pull if I were cast as the control group in a Turing test.
I already do such things winkingly when talking with my preschooler to send him epistemic tracer rounds and see if he's listening critically