Hacker News new | ask | show | jobs
by ravjo 736 days ago
Sounds great. Non-engineer, but curious. Is there a walkthrough blog post or video that can help someone appreciate/understand this easily?
2 comments

Attention in transformers, visually explained | Chapter 6, Deep Learning - 3Blue1Brown: https://www.youtube.com/watch?v=eMlx5fFNoYc&t=
Thank you
Loosely related, but also a great read: https://distill.pub/2020/circuits/zoom-in/