|
|
|
|
|
by vlovich123
935 days ago
|
|
I very cursorily skimmed your paper but I didn’t spot where it discusses overlapping the patches. Is it the section about using the hybrid model with a convolutional step which de facto accomplishes it (maybe?) instead of overlapping patches? |
|
ViT https://github.com/SHI-Labs/Compact-Transformers/blob/main/s...
CCT: https://github.com/SHI-Labs/Compact-Transformers/blob/main/s...
Edit: Actually here's a third party version doing the permutation then linear then reshape operation
https://github.com/lucidrains/vit-pytorch/blob/main/vit_pyto...
But the original implementation uses Conv: https://github.com/google-research/vision_transformer/blob/m...