Hacker News new | ask | show | jobs
by szundi 660 days ago
This is not “building from the ground up”
2 comments

Neither the author of the GPT from scratch post, nor eclectic29 who recommended it above did ever promise that the post is about building LLMs from the ground up. That was the original post.

The GPT from scratch post explains, from the ground up, ground being numpy, what calculations take place inside a GPT model.

Inference is nothing without training.
Why is that bad?