Hacker News new | ask | show | jobs
by jasonjmcghee 52 days ago
Highly recommend instead reading the human created "The Illustrated GPT-2" by Jay Alammar - https://jalammar.github.io/illustrated-gpt2/

And his similar work.

He also has a free course on "how llms work"

1 comments

This Jay Alammar guide is great! I used it when writing my own transformer