https://verticalserve.medium.com/group-query-attention-58283...
https://paperswithcode.com/method/multi-head-attention