Y
Hacker News
new
|
ask
|
show
|
jobs
by
joennlae
766 days ago
Trainable Llama-like transformer (with backpropagation) in numpy only (~600 lines)
https://github.com/joennlae/tensorli
1 comments
Zambyte
765 days ago
The description says GPT-like, but is is just a GPT, right?
link
p1esk
765 days ago
GPT refers to the specific family of models developed at OpenAI.
link
Zambyte
765 days ago
It also stands for generative pretrained transformer, which this seems to be.
link
p1esk
765 days ago
It’s like saying SSD is a YOLO. Both are single shot object detectors, but only YOLO is “a YOLO”.
link