Y
Hacker News
new
|
ask
|
show
|
jobs
by
bckr
197 days ago
That’s where they take their big pile of data and train the model to do next-token-prediction.