Hacker News new | ask | show | jobs
by xmcqdpt2 156 days ago
You can understand how transformers work from just reading the Attention is All You Need paper, which is 15 pages of pretty accessible DL. That's not the part that is impressive about LLMs.