|
|
|
|
|
by mrfusion
2059 days ago
|
|
Every time I research transformers it seems so hand wavy. Is there a simple description, maybe a bit of pseudo code? Or at the other extreme they dump me into formula land without exposing what all the letters in the formula represent. |
|
And here's a super simple implementation of GPT by Andrej Karpathy. [link](https://github.com/karpathy/minGPT/blob/master/mingpt/model....)