Hacker News new | ask | show | jobs
by thethimble 1299 days ago
What’s the advantage of learning this positional embedding vs hardcoding one that has the desired properties of location similarity?