Hacker News new | ask | show | jobs
by verdverm 514 days ago
This is a riff on the original "attention is all you need" paper, there has been a few of these lately
1 comments

A few? A multitude.
This one might be right if they have in fact unified multiple attention approaches into a single framework

see Section 3.4