Hacker News new | ask | show | jobs
by voidUpdate 7 days ago
Is that little computer training LLMs from scratch all by itself? That must take years to get any kind of progress, given the scale of training other providers do. Where do you get the training data from?
2 comments

You can train TinyStories in a few hours on retail hardware, and this is a highly illuminating experience that I can recommend for everyone.
Most of the training i am working on is with post training. You can do so much with a system that is running 24/7