Hacker News new | ask | show | jobs
by joennlae 766 days ago
Trainable Llama-like transformer (with backpropagation) in numpy only (~600 lines)

https://github.com/joennlae/tensorli

1 comments

The description says GPT-like, but is is just a GPT, right?
GPT refers to the specific family of models developed at OpenAI.
It also stands for generative pretrained transformer, which this seems to be.
It’s like saying SSD is a YOLO. Both are single shot object detectors, but only YOLO is “a YOLO”.