|
|
|
|
|
by hackerlight
818 days ago
|
|
I don't get how transformers can replace convolutional networks. My understanding is patches get fed in, and the transformer will do the same thing that a convolution layer does. But transformers deal with sequential data and I don't see any of that here? |
|