|
|
|
|
|
by knowtheory
3218 days ago
|
|
Nah, the paper explicitly states that their system is not recurrent nor convolutional: > To the best of our knowledge, however, the Transformer is the first transduction model relying
entirely on self-attention to compute representations of its input and output without using RNNs or
convolution. |
|