|
|
|
|
|
by Isinlor
528 days ago
|
|
Transformers are very bad at counting due to how their internals work. But if you ask them to use explicit counter the problem disappears: https://chatgpt.com/share/6775c9a6-8cec-8007-b709-3431e7a2b2... Basically one feed forward is not Turing complete, but autoregressive (feeding previous output back into itself) are Turing complete. |
|
Regardless, I’d love if you would explain a bit more why the transformer internals make this problem so difficult?