|
|
|
|
|
by fenomas
1165 days ago
|
|
> Could the model invent reversi/othello had it not known about it beforehand? You've practically restated the paper's findings! :D The LLM knew nothing about othello; it wasn't shown any rules to be recombined. It was shown only sequences of 60 distinct tokens - effectively sentences in an unknown language. The LLM then inferred a model to predict the grammar of that language, and the authors demonstrated that its model functioned like an othello board. |
|
Literal quote from the paper:
"As a first step, we train a language model (a GPT variant we call Othello-GPT) to extend partial game transcripts (a list of moves made by players) with legal moves."
And then:
"Nonetheless, our model is able to generate legal Othello moves with high accuracy".
So:
- it knows about the game because it was literally shown the game with only the legal moves
- it doesn't produce legal moves all the time (even though it does so with high accuracy)
That's why I say "the work done is beyond any shadow of the doubt brilliant". Because this is a definite leap forward from the status quo. However, it doesn't imply that the models can invent/predict/come up with novel ways of doing something. This is still strictly within the realm of "given existing data, give back a statistically relevant response".
Could it actually invent Reversi/Othello had it not known about it beforehand?