Hacker News new | ask | show | jobs
by bckr 197 days ago
That’s where they take their big pile of data and train the model to do next-token-prediction.