Hacker News new | ask | show | jobs
Attention Got So Efficient [GQA/MLA/DSA] [video] (youtube.com)
1 points by sameersegal 197 days ago