|
|
|
|
|
by HarHarVeryFunny
1113 days ago
|
|
Thanks, Jay! I wasn't aware of that BERT explainability paper - will be reading it, and watching your video. Are there any more recent Transformer Explainability papers that you would recommend - maybe ones that build on this and look at what's going on in later layers? |
|
Transformer Feed-Forward Layers Are Key-Value Memories https://arxiv.org/abs/2012.14913
The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns via Spotlights of Attention https://arxiv.org/abs/2202.05798
https://github.com/neelnanda-io/TransformerLens