here: https://threadreaderapp.com/thread/1233220586358181888.html
and here: https://graphdeeplearning.github.io/post/transformers-are-gn...