Hacker News new | ask | show | jobs
by YeGoblynQueenne 757 days ago
>> But this work shows in a controlled environment that the model can learn the principles of addition and extrapolate to much larger numbers.

No, because it's given hand-engineered embeddings that act as a strong inductive bias that is specific to addition. It's like addition is programmed right in.