Hacker News new | ask | show | jobs
by shardullavekar 93 days ago
has anyone come across an r2d3-style explainer for something as high-dimensional as a Transformer's attention mechanism?
1 comments