Hacker News new | ask | show | jobs
by jaymody 1152 days ago
I wrote a minimal implementation in NumPy here (the forward pass code is only 40 lines): https://github.com/jaymody/picoGPT

And also a related blog post: https://news.ycombinator.com/item?id=34726115

Although this is for a decoder-only transformer (aka GPT) and doesnt include the encoder part.

1 comments

And I adapted Jay's work to Typescript (without the numpy obviously, just raw typescript/javascript): https://github.com/newhouseb/potatogpt