Hacker News new | ask | show | jobs
by whoateallthepy 1157 days ago
One thing to bear in mind is that these embedding vectors are high dimensional, so that it is entirely possible that the token embedding and position embedding are near-orthogonal to one another. As a result, information isn't necessarily lost.