Hacker News new | ask | show | jobs
by isaacfung 709 days ago
Maybe it's easier to understand in the format of annotated code

https://nlp.seas.harvard.edu/2018/04/03/attention.html

1 comments

Updated (in case above link goes away) - https://nlp.seas.harvard.edu/annotated-transformer/

Thanks for original!