Hacker News new | ask | show | jobs
by bjornsing 263 days ago
I’d say an LLM is a form of Markov chain, where the state is defined to consist of the entire context window, and each state transition adds one token to that context window.