Hacker News new | ask | show | jobs
by ShamelessC 1647 days ago
It is believed that the solutions neural networks learn may be rather elegant, to a degree. It is perhaps unscientific to say that the neural network is always decomposing and re-composing tokens in a _generic_ and intelligent way; but it's becoming increasingly obvious that it probably is; particularly in the case of the "self-attention" architectures.

https://openai.com/blog/multimodal-neurons/

> The concepts, therefore, form a simple algebra that behaves similarly to a linear probe.

Because the loss encourages words to be mapped as linearly independent vectors; you can literally do addition/subtraction with concepts and it sort of works.