|
|
|
|
|
by phowon
2690 days ago
|
|
>Even transformers use RNN structures right. Nope. >How do you handle variable length input without something like an RNN? Any form of pooling, really. Max, Avg, Sum. The tricky part is how to do the pooling while still taking advantage of the sequential structure of the input information. The Transformer -based models have shown that you can get away with providing very little order information and still go very far. |
|