Hacker News new | ask | show | jobs
by metalloid 1149 days ago
The author of the article should had provided an implementation of the transformer using only numpy or pure C++.
1 comments

I wrote a minimal implementation in NumPy here (the forward pass code is only 40 lines): https://github.com/jaymody/picoGPT

And also a related blog post: https://news.ycombinator.com/item?id=34726115

Although this is for a decoder-only transformer (aka GPT) and doesnt include the encoder part.

And I adapted Jay's work to Typescript (without the numpy obviously, just raw typescript/javascript): https://github.com/newhouseb/potatogpt