|
|
|
|
|
by whoateallthepy
1157 days ago
|
|
One thing to bear in mind is that these embedding vectors are high dimensional, so that it is entirely possible that the token embedding and position embedding are near-orthogonal to one another. As a result, information isn't necessarily lost. |
|