|
|
|
|
|
by HarHarVeryFunny
656 days ago
|
|
> A transformer can perform any computation it likes in a forward pass No it can't. A transformer has a fixed number of layers - call it N. It performs N sequential steps of computation to derive it's output. If a computation requires > N steps, then a transformer most certainly can not perform it in a forward pass. FYI, "attention is all you need" has the implicit context of "if all you want to build is a language model". Attention is not all you need if what you actually want to build is a cognitive architecture. |
|
https://arxiv.org/abs/2310.02226
And again, human brains are clearly limited in the number of steps it can compute without writing something down. Limited =/ Trivial
>FYI, "attention is all you need" has the implicit context of "if all you want to build is a language model".
Great. Do you know what a "language model" is capable of in the limit ? No
These top research labs aren't only working on Transformers as they currently exist but it doesn't make much sense to abandon a golden goose before it has hit a wall.