Hacker News new | ask | show | jobs
by xianshou 714 days ago
Illustrated Transformer is amazing as a way of understanding the original transformer architecture step-by-step, but if you want to truly visualize how information flows through a decoder-only architecture - from nanoGPT all the way up to a fully represented GPT-3 - nothing beats this:

https://bbycroft.net/llm

1 comments

whoa, that's awesome.