|
|
|
|
|
by YeGoblynQueenne
1596 days ago
|
|
The "Deep Symbolic Regression" paper reports very poor generalisation results that break off after a small n (where n is the number of tokens in the predicted sequence). It works some of the time for n = 1 (predicts the next token) but accuracy drops off for n = 10. No results are reported for N > 10 as far as I can tell in the "Out of Domain Generalization" section (which is the meat and potatoes of the "generalization" claim). tl;dr they can sometimes generalise to the next 1 to 10 tokens (digits or operators), but no more. This kind of short-term "generalisation" on OOD data is standard in neural nets trying to approximate symbolic regressions or things like grammars etc as far as I know. I do like they use 'Out of Domain" rather than "Out of Distribution" as a target though. That makes more sense. |
|
If you think you can do better than their program then:
Seq1: [0, 1, 2, 3, 6, 7, 13, 26, 32, 58, 116, 142, 258, 516]
Seq2: [2, 2, 3, 5, 10, 12, 22, 44, 54, 98, 196, 240, 436, 872]
Seq3: [3, 1, 8, 9, 18, 19, 37, 74, 92, 166, 332, 406, 738, 1476]
Their program is able to guess correct continuation with one more sequence element.
SHA1 hash for verification: bef5e213340f91258b3b9a0042c9c083dd91cb80