Hacker News new | ask | show | jobs
by valine 394 days ago
So instead of next token prediction its next event prediction. At some point this just loops around and we're back to teaching models to predict the next token in the sequence.
2 comments

Tokens are an awfully convenient way to describe an event.
Tokens are just discretized state representations.
It’s the next state. So instead of spitting out words, it will spit out a whole movie, or a sequence of world states in a game or simulation.