Hacker News new | ask | show | jobs
Wrapping your head around self-attention and multi-head attention (ash-01xor.github.io)
2 points by ashvanth 722 days ago